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ABSTRACT 


In  this  thesis,  equations  are  derived  for  the  state  estimates 
and  error-covariances  of  discrete  and  continuous  nonlinear  systems  with 
and  without  delays,  corrupted  by  white  noise  as  well  as  non-white  noise 
processes . 

In  the  case  of  continuous  systems,  new  filtering  algorithms 
are  derived  for  nonlinear  systems  without  delays,  imbedded  in  white 
noise,  correlated  noise  and  noise  free  processes.  The  results  obtained 
for  white  noise  sequences  are  exact  and  optimal,  with  respect  to  the 
constraints  imposed  on  the  filtering  dynamic  equations,  minimizing  the 
error-variance  cost  functionals.  The  main  technique  makes  use  of  the 
matrix  minimum  principle  together  with  the  Kolmogorov  and  Kushner 
equations  to  derive  the  optimal  values  of  the  coefficients  in  the 
estimation  algorithms  under  the  requirements  that  the  estimates  be 
unbiased.  For  non-white  noise  processes  the  results  obtained  are 
suboptimal  since  an  approximate  assumption  has  been  made. 

The  algorithms  can  be  implemented  in  computer  evaluation 
and  are  recursive  in  nature  under  the  assumption  that  the  conditional 
probability  density  functions  of  the  estimator  errors  are  Gaussian. 
Various  nonlinear  systems  were  simulated  and  compared  with  results 
obtained  from  some  widely  used  finite  dimensional  approximate  nonlinear 
filters.  The  results  clearly  indicate  the  superiority  of  the  proposed 
minimum  variance  filter.  Results  pertaining  to  linear  problems  can 
be  easily  deduced  from  the  nonlinear  estimation  algorithms,  they  agree 
well  with  those  derived  in  the  literature,  using  other  optimization 
techniques . 
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In  the  case  of  discrete- time  systems,  new  nonlinear  estimation 
algorithms,  that  directly  yield  the  fixed-lag,  fixed-point  and  fixed- 
interval  smoothing  and  the  filtering  algorithms,  are  derived  for 
nonlinear  delayed  systems  with  non-del ayed  measurements  and  multi¬ 
channel  time-delayed  measurements,  corrupted  by  white  noise,  correlated 
noise  and  colored  noise  processes.  The  derivation  makes  use  of  the 
concept  of  the  gradient  matrix  to  minimize  the  error-variance,  taken 
to  be  the  estimation  criterion,  under  the  condition  that  the  estimates 
be  unbiased.  The  derivation  is  straightforward  and  clearly  indicate 
the  close  link  between  three  different  classification  of  smoothers 
and  the  filtering  estimator. 

For  systems  with  product  types  and  polynomial  nonlinearities, 
the  only  assumption  needed  to  implement  the  algorithms  in  computer 
evaluation,  is  to  assume  that  the  conditional  probability  density 
functions  of  the  estimator  errors  are  Gaussian.  They  are  expected 
to  be  computationally  efficient,  since  no  augmentation  of  state 
variables  are  introduced.  And  the  results  obtained  are  exact  and 
optimal  with  respect  to  the  imposed  constraints  on  the  dynamic 
equations  of  the  estimators,  minimizing  the  error  variance  cost 
functional s . 

The  results  can  also  be  applied  to  various  special  cases 
of  nonlinear  as  well  as  linear  systems  with  and  without  delays.  For 
linear  systems,  the  results  are  identified  with  those  in  the  literature. 
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CHAPTER  I 


GENERAL  INTRODUCTION 

1 . 1  INTRODUCTION 

Physical  systems  are  designed  and  built  to  perform  certain 
defined  functions.  In  order  to  determine  whether  a  system  is  performing 
properly  and  ultimately  to  control  the  system  performance,  the  engineer 
would  like  to  know  the  state  of  his  system.  To  determine  the  state  of 
his  system,  the  engineer  must  take  measurements  or  make  observations  on 
his  system.  These  measurements  are  generally  contaminated  with  noise 
caused  by  the  electronic  and  mechanical  components  of  the  device  used. 

The  problem  of  determining  the  state  of  a  system  from  noisy 
measurements  is  called  estimation  and  is  the  main  subject  of  this 
research  project.  In  general,  it  includes  the  problems  of  filtering, 
prediction  and  smoothing. 

In  many  applications,  it  is  meaningful  to  assign  a  cost 
function  to  an  estimate  representing  a  quantitative  measure  of  how  good 
"an  estimate"  is.  With  a  knowledge  of  the  cost  function,  the  system 
states,  and  parameters,  one  can  determine  the  best  control  which 
minimizes  (or  maximizes)  the  cost  function. 

The  problem  delineated  above  is  a  rather  old  problem,  dating 
back  almost  two  centuries  to  Legendre  (1806)  and  Gauss  (1809).  Gauss 
was  interested  in  determining  the  orbital  elements  of  a  celestial  body 
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from  observations  and  developed  the  technique  that  is  known  today  as 
least-squares  estimation. 

In  the  early  1940's,  Wiener  [1]  and  Kolmogorov[2]  discussed 
problems  of  linear  least-squares  estimation  for  stochastic  processes 
corrupted  by  additive  noise.  Their  solution  was  dependent  upon  the 
assumption  of  stationari ty ,  ergodicity,  and  knowledge  of  the  entire 
past  of  observed  processes.  But  in  actual  practice,  the  signal  and 
noise  processes  may  not  be  stationary,  nor  may  the  observation  time 
always  start  at  t  = 

Since  then,  numerous  attempts  have  been  made  to  remove  one 
or  both  of  the  above  restrictions.  The  notable  work  by  Kalman  [3,4] 
and  Kalman  and  Bucy  [5]  resulted  in  by  far  the  most  successful 
solution  to  the  problem.  They  considered  the  nonstationary  linear 
system  from  the  state  space  point  of  view  with  measurements  corrupted 
by  white  Gaussian  noise  processes,  and  obtained  recursive  solutions 
for  both  the  continuous  and  the  discrete  linear  cases.  The  original 
derivation  was  based  upon  the  derivation  of  the  Wiener-Hopf  equation, 
using  the  Orthogonal  Projection  Lemma  [3].  This  theory  has  become 
known  as  Kalman-Bucy  filtering,  and  because  of  the  obvious  computational 
advantages  of  the  recursive  algorithms,  it  has  found  numerous  applications 
in  the  field  of  missile  guidance,  space-vehicle  navigation,  state 
estimation  for  state-vector  control,  plant  identification  for  adaptive 
process  control  and  orbit  determination,  etc. 
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After  the  appearance  of  Bucy's  and  Kalman's  work  the 
study  of  linear  estimation  problems  was  further  generalized  to  the 
non-white  noise  problems,  that  is,  colored  or  no-noise  in  one  or  more 
of  the  measurements.  One  of  the  first,  and  in  many  ways  one  of  the 
most  complete  treatments  of  colored  noise  in  continuous  systems  was 
presented  by  Bryson  and  Johansen  [6].  The  discrete  colored 

noise  problems  was  examined  by  Bryson  and  Henrikson  [7].  More  recently, 
Stubberud  [8],  Stear  and  Stubberud  [9],  and  Sarachik  [10]  considered 
the  problems  for  continuous  systems  in  which  each  measurement  has  only 
colored  noise. 

Judging  by  the  literature,  recursive  solutions  for  the 
smoothing  problem  have  been  thought  to  be  harder  to  obtain  than  recursive 
filtering  solutions.  Previous  methods  have  been  proposed  by  Bryson  and 
Frazier  [11]  utilizing  the  calculus  of  variations,  Rauch,  Tung  and 
Striebel  [12]  adopting  the  maximum  likelihood  method,  Meditch  [13] 
using  orthogonal  projection,  and  most  recently,  Kailath  [14]  and  Kailath 
and  Frost  [15]  made  use  of  the  innovations. 

However,  dynamic  message  models  and  measurements  models  for 
the  majority  of  realistic  control  problems  are  inherently  nonlinear. 

Most  of  the  work  in  nonlinear  filtering  is  very  theoretical,  some  are 
no  more  than  a  philosophy  of  approach  rather  than  a  procedure  leading 
to  the  derivation  of  practical  estimations. 


One  of  the  main  lines  of  attack  to  the  nonlinear  estimation 
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problem  is  the  probability  approach  pioneered  by  Stratonovich  [16], 
and  subsequently  taken  up  by  Kushner  [17,  18],  Wonham  [19],  Bucy  [20], 
and  Mortensen  [21].  The  truly  optimal  nonlinear  filters  for  systems 
corrupted  with  additive  white  noise,  were  given  in  Kushner  [17,  22], 
however,  their  exact  solutions  required  infinite  dimensional  systems 
which  are  impossible  to  realize  except  in  trivially  simple 

cases . 

The  problems  of  obtaining  a  good  approximation  to  the  exact 
solutions  of  the  nonlinear  estimation  problems  are,  therefore  very 
important.  Extensive  work  has  been  carried  out  to  approximate  the 
optimal  nonlinear  filters  by  suboptimal  finite  dimensional  filters; 
amongst  them,  Cox  [23]  used  the  dynamic  programming  method,  Detchmendy 
and  Sridhar  [24]  used  the  technique  of  invariant  imbedding,  Friedland 
and  Bernstein  [25]  used  the  variational  techniques,  whereas  Bass  et  al. 
[26]  assumed  that  third  and  higher  order  central  moments  are  negligible. 
As  a  result,  the  conditional  means  and  covariance  equations  are  approx¬ 
imated  to  either  first  or  second-order  finite  dimensional  filters. 

The  majority  of  the  suboptimal  finite  dimensional  filters 
introduced  in  literature  involves  the  Taylor's  series  expansion  of 
the  nonlinear  functions,  but  not  all  nonlinear  functions  can  be  expanded 
in  Taylor's  series,  moreover,  for  systems  with  highly  pronounced  non- 
linearities,  Kushner  [27]  pointed  out  that  the  first  order  filter 
cannot  estimate  the  states  of  the  Van  der  Pol  oscillator..  Schwartz  and 
Stear  [28]  concluded  on  the  basis  of  some  simulation  results,  that  the 
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added  computational  complexity  may  not  improve  the  performance  of 
the  nonlinear  filter. 

Recently,  Sunahara  [29]  proposed  to  replace  the  nonlinear 
functions  by  quasi-linear  functions  via  stochastic  linearization,  but 
the  resulting  filters  are  yet  to  be  tested  and  compared  to  all  other 
approximate  filters. 

Sage  and  Ewing  [30,  31]  and  Sage  [32]  utilized  the  methods 
of  maximum  likelihood  and  invariant  imbedding  to  derive  approximate 
filtering  as  well  as  smoothing  algorithms  for  nonlinear  continuous  and 
discrete  systems.  The  continuous-time  algorithms  follow  from  the 
discrete-time  algorithms  via  the  Kalman  formal  limiting  argument  [4]. 

Other  works  on  nonlinear  smoothing  were  presented  by 
Lainiotis  [33],  Lee  [34]  and  Leondes  et  al .  [35].  In  particular, 

Leondes  et  al .  derived  the  exact  functional  differential  equations  for 
the  smoothing  density  functions  and  the  smoothed  estimates;  but  their 
solutions  are  prohibitive  except  in  trivially  simple  cases,  and  hence, 
approximations  are  developed  for  sequential  nonlinear  smoothing.  An 
iterative  technique  is  also  suggested  for  cases  where  the  nonlinearities 
may  be  severe. 

However,  the  above  nonlinear  estimation  problems  are  only 
concerned  with  nonlinear  systems  corrupted  by  additive  white  noise. 

To  the  author's  knowledge  optimal  nonlinear  filtering  as  well  as 
smoothing  for  non-white  noise  processes  have  not  been  derived. 
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As  far  as  state  estimation  problems  for  systems  with  time 
delays  are  concerned,  Kwakernaak  [36]  used  the  method  of  orthogonal 
projection  to  derive  filtering  equations  for  linear  continuous  systems 
with  multiple  time  delays,  which,  when  solved,  also  yield  smoothing 
estimates.  Priemer  and  Vacroux  [37,  38]  later  considered  the  estimation 
problems  in  linear  discrete  systems  containing  multiple  delays  in 
the  message  model.  Farooq  and  Mahalanabis  [39]  rederived  the  estimation 
algorithms  of  Priemer  and  Vacroux  using  the  state  augmentation 
technique,  however,  it  was  pointed  out  in  [37]  that  the  augmentation 
of  state  vectors  has  the  effect  of  increasing  the  dimension  of  the 
systems,  and  thus  lead  to  a  filter  that  is  computationally  inefficient. 
Recently,  Biswas  and  Mahalanabis  [40]  presented  smoothing  results  for 
the  continuous  systems  with  time  delays  by  first  discretizing  the 
continuous  systems  and  then  employed  the  state  augmentation  technique. 

To  the  author's  knowledge  nonlinear  filtering  and  smoothing 
algorithms  for  nonlinear  time-delayed  systems  corrupted  by  additive 
white  noise  as  well  as  non-white  noise  processes  have  not  been  derived 
in  literature. 

1 . 2  SCOPE  OF  TEE  THESIS 

In  this  thesis,  exact  and  approximate,  discrete  and  continuous 
state  estimation  algorithms  are  derived  for  nonlinear  systems,  with 
and  without  delays  in  the  message  models  and  measurement  models, 
involving  white  Gaussian  noise,  correlated  noise  and  colored  noise 


processes. 
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In  Chapter  II,  some  basic  concepts  of  state  estimation 
theory  and  mathematical  preliminaries  essential  to  the  work  of  this 
thesis  are  presented. 

Chapter  III  deals  with  continuous-time  nonlinear  systems 
without  delays  corrupted  with  additive  white  noise  as  well  as  non-white 
noise  processes.  In  Section  3.2,  the  noise  processes  are  assumed  to 
be  Gaussian  white,  the  basic  approach  makes  use  of  the  matrix 
minimum  principle  [44]  together  with  the  Kolmogorov  [2]  and  Kushner 
[17,  18]  equations  to  minimize  the  error  variance,  taken  to  be  the 
estimation  criterion.  However,  the  exact  algorithms  derived  in  such 
a  manner  require  infinite  dimensional  systems  to  realize,  which  is 
computationally  impossible.  In  order  that  the  estimation  algorithms 
can  be  physically  realized,  it  is  assumed  that  the  conditional 
probability  density  functions  of  the  estimator  errors  are  Gaussian. 

An  example  is  included  to  show  how  one  can  evaluate  the  proposed 
algorithms  under  the  above  assumption. 

For  the  purpose  of  assessing  the  performance  of  the  proposed 
minimum  variance  filter  and  to  compare  it  with  various  other  approximate 
finite  dimensional  filters,  in  Section  3.3  three  different  types  of 
nonlinear  systems  are  selected  and  simulated  on  the  digital  computer. 
Discussions  are  included  to  explain  the  apparent  discrepancies  amongst 
the  proposed  minimum  variance  filter  and  all  other  approximate  finite 
dimensional  filters. 

In  practical  situations,  nonlinear  dynamic  systems  are 
imbedded  in  non-white  noise  processes.  Therefore,  in  Sections  3.4  and 


' 
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3.5,  we  turn  our  attention  to  more  general  nonlinear  estimation 
problems,  they  are  respectively,  correlated  noise  and  noise  free 
processes  ,  and  the  colored  noise  problem  can  be  considered  as  a 
special  case  of  the  noise  free  estimation  problem. 

In  Chapter  IV,  discrete-time  filtering  and  smoothing 
algorithms  are  derived  for  nonlinear  time-delayed  systems  imbedded  in 
white  Gaussian  noise  processes.  In  Section  4.2,  the  measurement 
models  do  not  involve  any  time  delays  whereas  in  Section  4.3,  the 
measurements  can  be  considered  as  the  sum  of  signals  propagated  with 
different  delays.  The  main  technique  makes  use  of  the  matrix  minimum 
principle  to  derive  the  optimal  values  of  the  coefficients  in  the 
estimation  algorithms  under  the  requirements  that  the  estimates  be 
unbiased.  The  resulting  algorithms  can  be  recursively  evaluated  under 
the  assumption  that  the  probability  density  functions  of  the  estimator 

errors  are  Gaussian.  Examples  are  included  to  illustrate  the  use 

1 

of  the  proposed  estimation  algorithms,  in  particular,  they  provide  better 
insight  as  to,  how  one  can  properly  substitute  for  the  discrete-time 
indices,  in  order  to  arrive  at  the  filtering,  fixed-interval  smoothing, 
fixed-point  smoothing  and  the  fixed-lag  smoothing  algorithms.  The 
linear  estimation  problems  are  considered  solved,  since  they  can  be 
treated  as  a  special  case  of  the  nonlinear  estimation  problems. 

Chapters  V  and  VI  deal  with  discrete  nonlinear  time-delayed 
systems  involving,  respectively,  correlated  noise  and  colored  noise 
processes.  The  derivation  as  well  as  the  presentation  follow  that  of 
Chapter  IV.  Similar  to  that  of  Chapter  IV,  the  linear  estimation 


algorithms  for  these  problems  can  be  obtained  from  the  nonlinear 
estimation  algorithms. 

Chapter  VII  summarizes  most  of  the  results  presented  in 
Chapters  III  to  VI.  Also  outlined  are  some  recommendations  for  further 


research. 
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CHAPTER  II 


BASIC  CONCEPTS  AND  MATHEMATICAL  PRELIMINARIES 

2.1  BASIC  CONCEPTS 

Expectation:  If  x.  is  a  discrete  random  variable,  the  expectation 
of  the  function  h(x^)  is  defined  by 


E { h ( x i ) }  =  l  h(xi)  f  (xi) 


where  E  denotes  the  expectation  operator,  f(x^)  is  the  density  function 
associated  with  the  random  variable  and  the  x-  are  the  values  of  the 
discrete  random  variable  for  which  f(x.)  is  defined. 

In  the  same  way,  if  x  is  a  continuous  random  variable,  the 
expectation  of  the  function  h(x)  is  defined  by 


E{h(x) } 


< 

-I 


h(x)  f(x)  dx 


where  f(x)  is  the  density  function  associated  with  the  random  variable  x. 

The  expectation  operator  E{ • }  has  a  number  of  rather  obvious 
properties: 

1.  E{ h-j (x)  +  h2(x)}  =  E{ h-,  (x) }  +  E{h2(x)} 

2.  ETC  h(x) }  =  C  E { h ( x ) } 

3.  E { C }  =  C,  where  C  is  a  constant. 
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The  expectation  is  also  called  the  mean,  the  average  and  the 
first  moment. 

Variance:  The  variance  is  defined  for  the  discrete  case  and  for  the 
density  function  f(x.)  as 

Vx  ■  l  [xrE(x.)]2  f(Xl) 
and  for  the  continuous  case 

oo 

Vx  =  J  [x  -  E(x)]2  f(x)  dx 

—  oo 

The  variance  is  also  called  the  second  moment. 

A 

Unbiased  Estimate:  x  is  said  to  be  the  unbiased  estimate  of  x,  if  it 
satisfies  the  relation 

E{x  -  x}  =  0. 

Correlated  noise:  Two  random  white-noise  processes  v(t)  and  w(t)  are 
said  to  be  correlated  if  they  satisfy  the  relation 

E{v(t)  wT(s)}  =  Tvw  6( t-s ) 

for  all  t  and  s.  Where  T  is  the  matrix  transpose,  ¥  is  not  equal  to 

vw 


zero,  and  6  is  the  Dirac  delta  function. 

Coloved  noise :  A  colored  noise  process  is  a  time-correl ated  process, 
that  is,  it  exhibits  correlation  between  different  instants  of  time. 
Consider  v(t)  as  a  colored  noise  process,  then 

E{v(t)  vT(0>  f  0  for  t  f  t. 

In  general,  one  tends  to  describe  the  colored  noise  by  a  shaping  filter 
[42]  given  by  the  equation 

v(t)  =  A(t)  v(t)  +  w(t) 

where  A(t)  and  the  white  noise  process  w(t)  must  be  chosen  so  that  v(t) 
has  the  prescribed  statistics. 

Linear  estimation  problems:  Assume  that  the  measurement  process  of  a 
dynamical  system  is  given  by 

z(t)  =  H ( t)  x ( t )  +  v ( t ) 


in  the  continuous-time  case,  and 

z(k)  =  H (k)  x(k)  +  v(k) 


in  the  discrete-time  case,  where  t  denotes  the  continuous  time  with 
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t  >  t  ,  t  is  fixed;  and  k  =  0,1,...  is  the  discrete  time  index.  Here, 

—  oo 

z  and  v  are  m-vector  measurement  and  measurement  error  vectors,  respectively. 

Any  estimate  of  x(k),  k  =  0,  1,  ...,  which  is  based  on  the 
sequence  of  measurements 

(z(0),  z ( 1 ) ,  ...,  z ( j ) } , 

A 

j  >_  0  is  denoted  by  x(k/j).  Similarly,  in  the  continuous-time  case, 

A 

x(t/s)  is  the  estimate  of  x(t)  derived  from  the  measurement  set 
{ z( t ) ,  t  £  x  <_  s}.  These  estimates  are  classified  [3]  in  the  following 
according  to  the  value  of  k  or  t  relative  to  that  of  j  or  s,  respectively: 

Discrete  Process  Continuous  Process 


1.  Prediction  : 

k  >  j 

t  >  s 

2.  Filtering  : 

•<—> 

ii 

t  =  s 

3.  Smoothing  : 

k  <  j 

t  <  s 

In  general,  the  state  estimation  problem  is  to  obtain 
an  estimate  of  the  state  x(k)  or  x(t),  such  that  it  is  "best"  or 
optimal  according  to  some  estimation  error  functional. 

Smoothing :  The  smoothed  estimates  can  be  divided  into  three  separate 
classes  [43] 

A 

1.  Fixed-interval  smoothed  estimate  x(k/N),  k  =  0,1,...,  N-l,  N  =  fixed 
positive  integer. 

A 

2.  Fixed-point  smoothed  estimate  x(k/j),  j  =  k  +  1,  k  +  2,  ...,k=  fixed 
integer. 

3.  Fixed-lag  smoothed  estimate  x(k/k+N),  k  =  0,1,...,  N  =  fixed 


* 

■ 
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positive  integer. 

Similar  classification  can  be  made  in  the  continuous-time  case. 

2.2  TEE  KALMAN  FILTER 

Kalman  and  Bucy  [3,4]  considered  the  minimum  mean-square 
filtering  problem  of  linear  time-varying  systems  and  driven  by  a  white 
noise  process.  The  message  model  and  the  measurement  model  are 
given  by 


x(t)  =  F(t)  x(t)  +  G(t)  w(t) 
and 

z  ( t )  =  H  ( t )  x  ( t )  +  v  ( t ) 

respectively.  Where  w(t)  and  v(t.)  are  independent,  zero-mean  Gaussian 
white  noise  processes  with  positive  definite  variances  ^w(t)  and 
fv(t),  respectively. 

Their  block  diagrams  are  presented  in  Figure  2.1. 

The  optimum  linear  filter  is  obtained,  in  the  form  of  an 
algorithm  suitable  for  direct  computer  evaluation 


x(t)  =  F(t)  x(t)  +  K(t )  [z(t)  -  H(t)  x ( t ) ] 

V^( t)  =  F(t)  Vjt)  +  Vjt)  FT(t)  -  K(t)  Yv(t)  KT(t) 

X  XX 

+  G(t)  Tw(t)  GT(t) 


Noise 

v(t) 


Input 

w(t) 


G(t)  — Kg)— » 


x(t) 


Message  model  4”°”*  j  ►  Measurement  model 


Figure  2.1 


Block  diagram  of  the  message  and  measurement  models 
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where  K(t)  =  Vjt)  HT(t)  \\t) ,  x(t  )  =  0 

x 

The  block  diagram  for  the  above  optimum  linear  filter  is 
presented  in  Figure  2.2. 

Because  of  its  practicality  in  solving  the  estimation  problem, 
the  Kalman  filter  is  immensely  popular  in  aerospace  applications,  such 
as  in  navigation  and  guidance  control,  and  also  in  industrial  process 
control  applications. 

2 . 3  GRADIENT  MATRIX 

In  this  section,  the  basic  concept  of  a  gradient  matrix  is 
defined  and  some  identity  equations  are  presented.  The  basic  idea  and 
more  complete  treatment  of  this  topic  can  be  found  in  Athans  and 
Schweppe  [44]. 

Let  X  be  an  nxn  matrix  with  elements  x--.  Let  f(-)  be  a 

*  J 

scalar-valued  function  of  the  x. . 

'  si 


f ( X )  f (x-j i »  x-|2» 


»  xln,  X21  * 


As  an  example,  the  trace  of  the  matrix  X  is  a  scalar-valued 

function 


f (X )  =  trace  [X]  =  x^  +  x22  +  ...  +  xnn  . 


The  gradient  matrix  of  f(X)  with  respect  to  the  matrix  X  is 


h  '  4 

- 


17 


Figure  2.2  Block  diagram  of  the  Kalman  filter 


<x 
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defined  to  be  the  nxn  matrix  whose  ijtln  element  is  the  following  partial 
derivative 


3f(X) 

sx  •  • 
1J 


i » J  =  1,2, 


n. 


The  gradient  matrix  is  denoted  by 


3f(X) 

9X 


and  the  gradient  of  the  trace  of  X  satisfies  the  following  relations: 


—  trace  [X]  =  I 

trace  [AX]  =  AT 
trace  [AXT]  =  A 
~  trace  [AXB]  =  ATBT 
trace  [AXTB]  =  BA 
trace  [AXBX]  =  ATXTBT  +  BTXTAT 


■ 
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trace  [AXBXT]  =  ATXBT  +  AXB 

Additional  relations  can  be  obtained  through  the  use  of  the 
following  trace  identities 

trace  [AB]  =  trace  [BA] 
trace  [AB^]  =  trace  [BA^] . 

2 . 4  MATRIX  MINIMUM  PRINCIPLE 

2.4.1  Continuous -Time  Matrix  Minimum  Principle 

Consider  a  system  with  "state  matrix"  X(t),  "Control  matrix" 
U(t),  whose  elements  are  respectively ,  x • •  and  u  .  Assume  that  U(t) 

I J  ap 

e  n,  described  by  the  matrix  differential  equation 

X(t)  =  F [X ( t ) ,  U(t),  t]  ; 

and  the  cost  functional  is  defined  as 

T 

J  =  K[X(T),  T]  -  K[X(t0),  tQ]  +  I  L[X(t),  U(t),  t]  dt  ; 

t  and  T  are  fixed 

where  L[*]  and  K[ * ]  are  scalar-valued  functions  of  their  argument 
satisfying  the  usual  differentiability  conditions. 


■ 


20 


Let  P ( t )  denote  the  costate  matrix.  Define  the  scalar 
Hamiltonian  function  H  by 


H[X(t),  P(t),  t,  U(t)]  =  L[X ( t ) ,  U(t),  t] 


+  trace  { F[X ( t) ,  U(t),  t]  PT(t)} 


If  *  is  used  to  indicate  optimal  quantities,  then  there 
exists  a  costate  matrix  P*(t)  such  that  the  following  necessary 
conditions  are  satisfied: 

(1)  First  Necessary  Conditions 


x*(t)  =  Ip^y  I*  =  F[X*(t).  U*(t).  t] 


p*(t) 


3H  | 

WtJ  •* 


9H  I  =  0 

WtT  '* 


(2) 


Boundary  Conditions 


At  the  initial  time  X*(t  )  = 


3K[X*(t0).tQ] 

3X*(t0) 


At  the  terminal  time 


3 

3X*(T) 


P*(T)  = 


K[X*(T).  T] 


2.4.2  Discrete-Time  Matrix  Minimum  Principle 


Consider  the  discrete  system  defined  by  the  equation 

X(k+1 )  =  F [ X ( k ) ,  U(k) ,  k]  ,  k  =  0,1,  ...,  N-l 


with  scalar  cost  functional  defined  by 

N-l 

J  =  K[X( N) ,  N]  -  K[X(0) ,  0]  +  l  L [ X ( k ) ,  U(k),  k] 

k=0 

Then  the  Hamiltonian  function  can  be  defined  by  the  relation 
H  =  L [X ( k ) ,  U (k) ,  k]  +  trace  (F[X(k),  u(k),  k]  PT(k+l)} 

where  P(k)  is  the  costate  matrix. 

It  is  assumed  that  F[*]»  K[*]  and  !_[•]  satisfy  the  conditions 
required  by  the  discrete  minimum  principle  [45]. 

Then  the  discrete  matrix  minimum  principle  states  that  there 
exists  a  costate  matrix  P*(k),  k  =  0,1,  ...»  N,  such  that  the  following 
equations  are  satisfied 

(1)  First  Necessary  Conditions 
X*(k+1'  =  3P(k+l )  I* 


p*<k+1>  ■ 


* 
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(2) 


When 


When 


3ll*(k)  0 

Boundary  Conditions 

k=0,  P*(0)  =  3x*(0)K[X*(O)>O] 
k=N,  P*(N)  =  K[X(N),  N] 


CHAPTER  III 


EXACT  AND  APPROXIMATE  MINIMUM  VARIANCE 
FILTERING  FOR  NONLINEAR  CONTINUOUS  SYSTEMS 

3.1  INTRODUCTION 

This  chapter  is  devoted  to  the  estimation  problems  of 
nonlinear  continuous  systems  without  delays  corrupted  by  white  Gaussian 
noise  as  well  as  non-white  noise  processes. 

Research  in  the  area  of  nonlinear  filtering  has  been 
mushrooming  during  the  past  few  years  as  evidenced  by  the  large  number 
of  publications  and  reports  on  this  topic.  In  general,  there  are 
two  distinct  methods  of  approach;  one  such  approach  extends  the  Kalman- 
Bucy  filters  to  the  nonlinear  dynamics  systems  using  the  so-called 
first-order,  quasilinear  filter,  or  extended-Kalman  filter  [11,  23,  25, 
etc.].  Another  method  is  developed  by  Kushner  [22],  which  is  based 
upon  the  determination  of  the  exact  equations  satisfied  by  the 
conditional  probability  density  functions  and  conditional  expectations. 

This  approach  uses  the  stochastic  Ito  [46]  calculus,  the  results 
obtained  are  optimal,  however,  they  cannot  be  realized  by  finite 
dimensional  systems.  In  order  that  the  solutions  be  physically  realizable, 
the  exact  equations  are  approximated  to  derive  suboptimal  finite 
dimensional  filters  [17,  18,  20,  22,  26,  27,  etc.].  However,  in  spite 
of  the  abundance  of  papers  on  optimal  and  suboptimal  nonlinear  filters 
corrupted  by  white  noise  processes,  to  the  author's  knowledge  optimal 
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nonlinear  filtering  algorithms  for  non-white  noise  processes  have  not 
been  derived  in  literature. 

Therefore,  this  chapter  presents  a  modified  approach  to 
solve  nonlinear  state  estimation  problems  involving  (1)  additive  white 
noise,  (2)  correlated  noise,  and  (3)  noise-free  processes.  The  basic 
approach  makes  use  of  the  matrix  minimum  principle  to  minimize  the 
error  variance  cost  function,  which  is  obtained  from  the  exact 
conditional  probability  density  functions,  as  presented  in  Kushner 
and  Kolmogorov's  equations.  Therefore,  it  is  not  unexpected  that  the 
exact  nonlinear  filtering  equations  obtained  in  this  thesis  for  white 
noise  problems  closely  resemble  those  derived  by  Bass  et  al .  [26]. 

The  only  difference  is  that  the  argument  of  the  expectations  have  been 
transformed  from  x(t)  into  its  estimator  error  x(t).  As  a  consequence, 
for  filtering  problems  with  polynomi al  , product  types  or  piecewise 
continuous  nonlinearities,  the  proposed  algorithms  can  be  evaluated 
without  any  other  approximations  under  the  assumption  that  the  conditional 
probability  density  functions  of  the  estimator  errors  are  Gaussian. 

Here,  the  estimation  algorithms  derived  for  non-white  noise 
processes  are  suboptimal,  since  the  change  in  probability  density  function 
due  to  the  differential  measurements  6Z  is  not  known,  and  is  neglected 
in  our  derivation.  To  the  author's  knowledge  the  filtering  algorithms 
presented  for  non-white  noise  processes  are  new  and  are  reported  in 
[47,  48].  In  the  particular  case  that  the  dynamical  systems  are  linear, 
the  resulting  algorithms  are  optimal  and  agree  well  with  those  of  the 
literature  [6,  49]. 


' 
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Since  there  is  a  lack  of  published  papers  that  give  clear 
comparisons  between  the  performance  of  various  approximate  nonlinear 
filters,  and  in  order  to  test  the  performance  of  the  proposed  minimum 
variance  filters,  extensive  simulation  results  accompanied  with 
discussions  are  presented  to  compare  the  performance  characteristics 
of  these  filters. 

3.2  OPTIMAL  MINIMUM  VARIANCE  CONTINUOUS  NONLINEAR  FILTERING  WITH 
WHITE  NOISE  PROCESSES 

Consider  the  class  of  nonlinear  systems  described  by  the 
stochastic  differential  equation  [46] 

^=f[x(t),t]U[x(t),t]»(t)  (3.1) 

with  the  measurement  given  by 

z(t)  =  h[x(t) ,  t]  +  v(t)  (3.2) 

Where  x(t)  and  z(t)  are  the  n-dimensional  state  and  m-dimensional 
measurement  vectors,  f  and  h  are,  respectively,  n  and  m-dimensional 
nonlinear  vector  valued  functions,  and  G  is  a  vector  valued  matrix. 

The  random  vectors  w(t)  and  v(t)  are,  statistically 
independent  zero-mean  white  Gaussian  noise  processes  such  that  for  all 
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Cov{w(t) ,  W ( t )  }  =  Y  (t)  6(t-x) 

w 

(3.3) 

Cov{v(t) ,  v(t)}  =  ^v( t)  6 ( t-x) 
and  Cov{w(t),  v(x)}  =  0 

Where  the  variances  ^(t)  and  y  (t)  are  non-negative  definite  and 
positive  definite,  respectively. 

The  initial  state  vector  x(t  )  =  xQ  is  a  zero-mean  Gaussian 
random  process,  independent  of  w(t)  and  v(t)  for  t  >_  t  ,  with  a 
positive  definite  variance  matrix 

Var{x(tQ),  x(tQ) }  =  Vx(tQ) 

In  the  typical  filtering  problem,  it  is  required  to  compute 
x(t),  the  unbiased  estimate  of  x(t)  conditioned  on  the  set  of  measurements 

Z(t)  =  [z(s)/tQ  i  s  <_  t], 
such  that  the  cost  function  for  x  >  t 

J(t)  -  E { [x ( t )  -  i(x)]T  M(t)  [x(t)  -  x(t)]}  (3.4) 

is  minimized.  Here  M(x)  is  an  arbitrary  symmetric  positive  definite 


matrix. 


■ 
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The  filtering  algorithm  is  assumed  to  satisfy  the  general 
nonlinear  differential  equation 

x(t)  =  £[x(t),  t]  +  K(t)  z(t)  (3.5) 

A 

where  &[x(t),  t]  and  K(t)  are  yet  unknown.  Then  the  estimation  problem 
is  to  determine  the  time  varying  nonlinear  vector  valued  function 
£[x(t),  t]  and  the  gain  algorithm  K(t)  such  that  the  cost  function  of 
Equation  (3.4)  is  minimized. 

In  fact,  the  nonlinear  filtering  equation  may  be  assumed  to 

A 

take  various  forms,  however,  once  enough  information  concerning  x(t) 
and  z(t)  are  included  in  the  estimator  model,  the  resulting  filtering 
algorithms  would  be  unique.  For  example,  other  dynamic  filtering 
equations  such  as 

x(t)  =  £[x(t),  t]  +  K(t)  (z(t)  -  E[z ( t ) ] }  (3.6) 

can  also  be  assumed.  Comparing  the  structures  of  Equations  (3.5)  and 
(3.6),  it  is  obvious  that  an  extra  term  -K(t)  E[z(t)]  is  included  in 
Equation  (3.6),  however,  the  resulting  filtering  algorithm  using 
either  one  of  the  above  estimator  models,  would  result  in  exactly  the 
same  nonlinear  filtering  algorithm. 

Now,  let  x(t)  denote  the  estimator  error  defined  by 

x(t)  =  x ( t)  -  x(t) 


(3.7) 


•>  *' 


' 


. 
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using  Equations  (3.1),  (3.2)  and  (3.5),  the  derivatives  of  Equation 
(3.7)  becomes 


x(t)  =  f[x(t)  +  x(t),  t]  +  G[x( t)  +  x(t),  t]  w(t) 


-  £[x(t),  t]  -  K(t )  {h[x(t)  +  x(t),  t]  +  v(t)}  (3.8) 

A 

Since  x(t)  is  required  to  be  an  unbiased  estimate,  it  therefore 
requires  the  expectations  of  both  x(t)  and  x(t)  be  zero.  Hence,  if 
the  expectations  of  both  sides  of  Equation  (3.8)  are  taken,  it  is 
necessary  that 


*[x(t),  t]  =  f[x(t)  +  x(t),  t]  -  K(t)  {h[x( t)  +  x(t),  t]} 

(3.9) 

where 

f[x(t)  +  x(t),  t]  =  E { f [x ( t )  +  x(t),  t]/Z ( t) } 

h[x(t)  +  x ( t ) ,  t]  =  E{h[x(t)  +  x ( t ) ,  t]/Z(t ) } 


Then  the  estimator  error  can  be  shown  to  satisfy  the  relation 


x(t)  =  f*[x(t),  x(t),  t]  +  G*[x(t),  x(t),  t]  w*(t)  (3.10) 


where 
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f*[x(t),  x(t),  t]  =  f[x(t)  +  x(t),  t]  -  f[x(t)  +  x  ( t ) ,  t] 


and 


+  K(t )  { h[x ( t )  +  x  ( t ) ,  t]  -  h[x(t)  +  x(t),t]} 


G*[x(t),  x(t),  t]  =  [G[x(t )  +  x(t),  t],  -  K(t)] 


w(t) 

v(t) 


Furthermore,  Equation  (3.2)  can  be  rewritten  as 


z(t)  =  h[x(t)  +  x ( t ) ,  t]  +  v(t)  (3.11) 

now  the  filtering  problem  of  Equations  (3.1)  and  (3.2)  has  been 
transformed  into  that  of  Equations  (3.10)  and  (3.11). 

Let  <f>[x(t)]  be  a  twice  continuously  differentiable  function 
of  the  vector  x(t);  by  definition  of  the  conditional  expectation 
operator 


d  E{ <j>[x( t ) ] }  =  J  4>[x( t ) ]  dp[x( t ) ,  t/Z ( t) ]  d  x(t)  (3.12) 

Where  p[x(t),  t/Z ( t ) ]  is  the  conditional  probability  density  function. 

Next,  the  change  in  p[x(t),  t/Z ( t ) ]  due  to  the  dynamic 
equations  of  (3.10)  and  (3.11)  must  be  computed.  It  can  be  shown  that 
[46] 
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<5p  =  p[x(t  +  6t ) ,  t  +  <5t/Z(t ) ,  6z]  -  p[x(t),  t/Z(t)] 

=  p[x(t  +  6t),  t  +  6t/Z(t),  6z]  -  p[x(t),  t/Z(t),  6z] 

+  p[x(t),  t/Z(t),  Sz]  -  p[x(t),  t/Z(t)]  (3.13) 


where  the  first  two  terms  are  simply  the  change  due  to  the  dynamic 
equation  of  (3.10)  and  the  last  two  terms  are  due  to  the  differential 
measurements  6z.  These  two  changes  are  given  by  Kolmogorov  and 
Kushner's  equation  [20,  22].  Therefore, 

dE{tfxjt)Jl  =  E{£  <j.[x(t)]}  +  E{*[x(t)]{h[x(t)  +  i(t).  t] 

-  h[x(t)  +  x(t),  t] >}  4’v(t)_1{z(t)  -  h[x(t) 

+  x(t) ,  t]}  (3.14) 

where 

Lf[x(t)]  =  l  f*[x(t),  x(t),  t] 

i=l  1  sxi 

+  \  l  {G*[x (t) ,  x ( t ) ,  t]  Y  *(t)  G*T[x(t) ,x(t) ,t]}ij 
i  »j  =  l 

.  92^[x,(t]].  (3.15) 

0.  Oj 


Then  the  error-variance  equation  is  obtained  from  Equations 
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T 

(3.10),  (3.11),  (3.14)  and  (3.15)  by  setting  4>[x(t)]  =  x(t)  x  (t). 

For  which 

dVjt)  T 

- =  E{x( t)  f*T[x(t),  x ( t ) ,  t]  +  f*[x(t),  x ( t ) ,  t]  x  (t)} 

+  E{G[x( t)  +  x ( t ) ,  t]  ^w(t)  GT[x(t)  +  x ( t ) ,  t] 

+  K(t)  Yv(t)  -KT(t)}  +  E { x ( t )  x  (t)  hT[x(t) 

+  x(t),  t]  -  V^(t)  hT[x(t)  +  x(t),  t] } 
x 

,?v"1(t)  {z ( t )  -  h[x(t)  +  x ( t ) ,  t]}  (3.16) 

where  it  can  be  shown  that 

E{x( t)  f*T[x(t),  x ( t ) ,  t] }  =  E{x(t)  fT[x(t)  +  x(t),  t] 

-  x ( t)  hT[x(t)  +  x(t),  t]  KT ( t ) } 

On  the  other  hand,  the  initial  condition  is  given  by 

W  =  E{ [x ( t  )  -  x(t0)][x(t0)  -  i(t0)]}  =  V  (t  ) 
x 

Now  the  estimation  problem  takes  the  form  of  an  optimal  control 


problem  in  which  K(t)  is  the  only  variable  available  for  manipulation 
such  that  the  cost  function  of  Equati on(3. 4)  is  minimized. 

However,  Equation (3. 4)can  be  expressed  in  a  more  convenient 
form  by  observing  that  for  any  two  column  vectors  x  and  y,  the 
following  identity  equation  holds 


x  y  =  trace  [y  x^] 


it  follows  that 


T 

J ( t )  =  E{trace[M(x)  x(x)  x  (t)]} 


T 

-  trace  {M(t)  E[x(t)  x  (t)]> 


=  trace  {M(t)  V  (x)} 

x 

To  use  the  matrix  minimum  principle,  to  solve  this  problem, 
a  costate  matrix  P(t)  is  defined,  and  the  Hamiltonian  is  given  by 

H  =  trace  {V  (t)  PT(t)} 
x 

Using  the  concept  of  gradient  matrices  [44]  the  necessary 


condition  which  the  optimal  K(t)  must  satisfy  is  obtained  as 
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K(t)  =  E{x(t)  hT[x(t)  +  x(t),  t]}  fy"1  (3.17) 

note  that  K(t)  is  independent  of  the  costate  matrix  P ( t )  and  the 
weighting  factor  M(t). 

Then  the  filtering  estimate  and  the  error-variance  equation 

become 

x(t)  =  f[x(t)  +  x(t),  t]  +  E{x(t)  hT[x(t)  (3.18) 

+  x(t),  t] }  ^v_1(t)  (z(t)  -  h[x(t)  +  x(t),  t]} 
and 

dVjt) 

—  =  E { x ( t )  fT[x ( t )  +  x(t),  t]  +  f [x ( t ) 

+  x(t),  t]  x  ( t ) }  +  E{G[x(t)  +  x(t),  t]  \.(t)  GT[x( t) 

W 

+  x(t),  t] }  -  E{x(t)  hT[x(t)  + 

,  A  .  J 

x(t),  t] }  Yv"  (t)  E{h[x(t)  +  x(t),  t]  x  (t)} 

+  E{x( t)  x  (t)  hT[x( t)  +  x(t),  t]  -  Vjt)  hT[x( t) 

x 

+  x(t),  t] }  'i'v’1(t)  {z(t)  -  h[x(t)  +  x(t),  t] } 


(3.19) 


■ 


. 
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respectively. 

It  should  be  noted  that  before  the  above  algorithms  can  be 
physically  realized,  a  number  of  difficult  expectations  must  be  evaluated. 
They  involve  infinite  dimensional  systems  except  in  trivially  simple 
cases.  Also  notice  that  the  presented  algorithms  closely  resemble 
the  exact  equations  due  to  Bass  et  al  [26].  The  only  difference  is 
that  the  argument  of  the  expectations  have  been  transformed  into 
x(t).  Such  a  transformation  is  particularly  significant  for  filtering 
problems  with  polynomial,  product  types  or  piecewise  continuous  non- 
linearities.  In  such  cases,  the  filtering  algorithms  can  be 

obtained  without  any  further  approximations,  under  the  assumption 
that  the  probability  density  functions  of  the  estimator  errors  are 
Gaussi an. 

For  example,  consider  a  nonlinear  system  with  message  and 
measurement  models  described  by 

x ( t )  =  -  sin  x(t)  +  u(t) 
and  y(t)  =  x3(t)  +  v(t) 

respectively.  Where  u(t)  and  v(t)  are  zero-mean  white  Gaussian  noise 
processes,  with  variances  y  (t)  and  u,v ( t ) ,  respectively. 

Assuming  that  x(t)  is  a  Gaussian  process,  then  the  expectations 
of  the  nonlinear  functions  can  be  evaluated  as  follows 

.3 

E[x3(t)]  =  E{[x(t)  +  x(t)]3}  =  3Vjt)  x~(t)  +  x  (t) 

X 


. 


-Vjt)/2 

E[sin  x(t)]  =  E{sin[x(t)  +  x(t)]}  =  sin  x(t)  •  e  x 
E[x(t)  sin  x ( t ) ]  =  -  cos  x(t)  V  (t)  e  x 

'Xi 

x 

2  ^3 

E[x  (t)  X3 ( t) ]  =  9V  2(t)  x(t)  +  V  (t)  x  (t) 

rb  b 

X  X 


Hence,  Equations  (3.18)  and  (3.19)  become 


;  „  -\W/2  2 

x(t)  =  -sin  x ( t )  -e  x  +  [3V2(t)  +  3V%(t)  x  (t)]f'1 [y(t) 

X  X 


and 


-  3Vjt)  x ( t )  -  x  ( t)] 
x 


■V( t)/2 


V„(t)  =  -2Vjt)  e 
x  x 


cos  x ( t )  +  Yu(t) 


-  [3\2(t)  +  3Vjt)  X*  (t)]2  T  ■1(t) 

X  X 


+  [6V^2(t)  x(t)]  ^v_1 (t)[y(t) 
x 


3V^( t)  x ( t ) 
x 


.3 

x  (t)] 


respectively. 

Note  that  in  the  evaluation  of  the  expectations  the  only 
assumption  needed  is  the  Gaussian  assumption  of  the  estimator  error, 
whereas  all  other  finite  dimensional  algorithms  delete  some  of  the 
higher  order  terms,  in  particular  the  higher  order  terms  of  the  error- 
variance.  When  the  nonlinear  functions  in  Equations  (3.18)  and  (3.19) 
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are  approximated  by  Taylor  series  expansions,  the  results  obtained  can 
be  identified  with  various  other  approximate  nonlinear  filtering 
algorithms  in  literature  [23,  24,  26,  28]. 

Furthermore,  the  above  derivation  can  be  extended  to  filtering 
problems  with  non-white  noise  processes.  However,  in  such  cases,  the 
change  in  p[x(t),  t/Z ( t ) ]  due  to  the  differential  measurements  6z  is 
not  available  in  the  literature,  and  to  the  author's  knowledge  optimal 
nonlinear  filtering  algorithms  for  such  problems  have  not  yet  been 
derived. 

As  an  approximation,  the  effect  due  to  the  differential 
measurements  6z  is  neglected,  and  Equation  (3.14)  becomes  simply 

dELTMUlt  =  E{  L  <|>[x(t)]}  (3.20) 

It  is  noteworthy  to  mention  that  when  such  an  approximation 
is  made,  the  results  obtained  for  the  filtering  problem  corrupted  by 
white  noise  processes  are  equivalent  to  that  of  the  stochastic  linear¬ 
ization  due  to  Sunahara  [29].  In  such  a  case  the  random  forcing  term 
in  the  error-variance  equation  of  Equation  (3.19)  is  neglected,  whereas 
the  filtering  algorithm  of  Equation  (3.18)  remains  unchanged. 

When  Equation  (3.20)  is  used  in  place  of  Equation  (3.14)  to 
obtain  the  error-variance  equation,  and  the  nonlinear  functions  are 
approximated  by  second  order  Taylor  series  expansions,  the  resulting 
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algorithms  are  commonly  called  modified  minimum  variance  filter  [28]. 

3.3  ILLUSTRATIVE  EXAMPLES  AND  DISCUSSIONS 

In  order  to  test  the  performance  of  the  proposed  minimum 
variance  filter  and  to  compare  it  with  various  other  approximate  nonlinear 
filters  [23-24,  26-28].  Three  different  types  of  nonlinear  systems  are 
selected,  the  stochastic  equations  are  transformed  to  Stratonovich ' s 
forms,  and  then  simulated  on  an  IBM  360  computer  using  CSMP.  The 
simulation  results  presented  only  show  the  estimates  for  one  simulation 
run  .  The  integration  was  performed  by  the  rectangular  and  fixed 
fourth-order  Runga-Kutta  method,  the  step  size  chosen  ranged  from  0.001 
to  0.00005. 

Meanwhile,  various  structures  of  dynamic  finite  dimensional 
filters  and  their  error-variance  equations  are,  respectively ,  listed 
in  Tables  3.1  and  3.2,  where  the  scalar  case  is  considered. 

It  can  be  observed  that  amongst  the  truncated,  the  quasi¬ 
moment  and  the  modified  minimum  variance  filters,  the  only  difference  is 
the  random  forcing  term  that  appears  in  the  error- variance  equations. 

This  is  also  true  in  comparing  the  stochastic  linearization  filter  and 
the  minimum  variance  filter  proposed  in  Section  3.2. 

For  Figures  (3.1)  -  (3.10),  the  system  models  corresponding 
to  the  example  of  Section  3.2  were  simulated,  the  random  forcing  term 
for  such  systems  is  given  by 
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Table  3.1  Dynamic  Equations  of  Various  Nonlinear  Filters 

System  Dynamics  x  =  f(x)  +  g(x)  w 

Measurement  Model  z  =  h(x)  +  v 


Filter  Nomenclature 

Filter  Dynamics 

Modified  Minimum  Variance 

Filter 

• 

x  =  f(x)+  iv^  f"(x) 

X 

+  V^h' (x)T  _1{z-h(x)-  A  V^h' 1 (x)} 

X  X 

(A) 

Truncated  Minimum  Variance 

Fi  1  ter 

Same  as  Eq.  (A) 

Quasi -Moment  Minimum 

Variance  Filter 

Same  as  Eq.  (A) 

Stochastic  Linearization 

Filter  (Sunahara) 

i — i 

X 

<  xz 

1 

N 

1 _ l 

1 

> 

1 - 1 

X 

xz 

ex 

i _ i 

LU 

+ 

X 

<  <4- 

II 

•  <  X 

(B) 

Proposed  Minimum 

Variance  Filter 

Same  as  Eq.  (B) 

The  prime 


II  I  II 


denotes  the  first  derivative  with  respect  to  the  argument. 
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Table  3.2  Error-Variance  Equations  of  Various  Nonlinear  Filters 

Systems  Dynamics  x  =  f(x)  +  g(x)  w 

Measurement  Model  z  =  h(x)  +  v 


Filter  Nomenclature 

Error-Variance  Equations 

Modified  Minimum  Variance 

• 

V 

a. 

=  2 Vy  (x)  +  g2(x)  Tw 

(C) 

Fi  1  ter 

X 

-  v%2  h,2(i)  f/1 

X 

Truncated  Minimum 

• 

V 

Oj 

=  Right-hand-side  of  Eq. 

(C) 

Variance  Filter 

X 

-  iv2h' ‘ {x)T~1{z-h(x)-  l 

V  h' 

a. 

‘  (x)  > 

X 

X 

Quasi -Moment  Minimum 

• 

V 

<\i 

=  Right-hand-side  of  Eq. 

(C) 

Variance  Filter 

X 

+  V2h"(i)f'1{z-h(x)-  \ 

V  h'  ' 

'V 

'(x)} 

X 

X 

Stochastic  Linearization 

• 

V 

a, 

=  2E[xf (x)]+  E[g2 (x)  fw] 

(D) 

Filter  (Sunahara) 

X 

-{E[x  h(x)]}2  Tv_1 

Proposed  Minimum  V^  =  Right-hand-side  of  Eq.  (D) 

2  l  - 

+  E[x  h(x)  -  V^h(x)]  Yv  [z-h(x)] 
x 


Variance  Filter 


. 

r 
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3 

6  V^2(t)  x(t)  V  ~  (t){y(t)  -  3  V  (t)  x ( t )  -  x  (t)} 
x  x 

Four  sets  of  prior  statistics  were  assumed,  the  results  are 
very  interesting. 

In  Figures  (3.1)  and  (3.2),  the  statistics  are  given  by 

x(0)  =  -3.0,  x(0)  =  0.0, 

VJO)  =  2.25,  ^  =  ¥v  =  1.0 

X 

whereas  in  Figures  (3.3)  and  (3.4) 

x(0)  =  1.5,  x(0)  =  0.0 

V  (0)  =  2.25,  y  =  y  =0.1 
^  u  v 

X 

For  these  two  sets  of  statistics,  theoreti cal ly,  it  can  be 
shown  that  the  contribution  due  to  the  random  forcing  term  is  not 
significant  at  all,  therefore,  it  can  be  expected  that  the  results  from 
the  truncated,  the  quasi-moment  and  the  modified  minimum  variance  filters 
are  not  considerably  different  from  one  another,  and  also  the  stochastic 
linearization  filter  would  not  deviate  from  the  proposed  minimum  variance 
filter  in  any  significant  way.  Both  are  verified  experimental ly 
and  are  obvious  from  Figures  (3.1)  -  (3.4). 

However,  from  Figures  (3.1)  -  (3.4),  it  is  obvious  that  the 
performance  characteristics  of  the  stochastic  linearization  and  the 

proposed  minimum  variance  filters  are  significantly  superior  to  all  other 
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0 


VL 


-0.75 


X  =  -SIN  X  +  U 

Y  =  X3  +  V 

V-(0)  =  2.25 
x 


=  1.00 
=  1.00 
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TIME  (sec) 


Figure  3.1  The  output  run  of  the  estimator  error  for  systems 

with  x(0)=0.0,  x(0)=-3.0,t  =y  =1.0  and  V^= 2.25 
v  v  '  U  V  x 


i 


ERROR  VARIANCE 
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TIME  (sec) 


The  output  run  of  the  error-vari ance  for  systems 


with  x(0)=0.0,  x(0)=-3.0,  Y  =¥=1.0,  and  V^=2 . 25 

U  V  X 


Figure  3.2 


ESTIMATOR  ERROR 
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Figure  3.3  The  output  run  of  the  estimator  error  for  systems 


with  x(0)=0.0,  x(0)  =  1.5,  ¥  ^  =0.1  and  V^= 2.25 


ERROR  VARIANCE 


TIME  (sec) 


Figure  3.4  The  output  run  of  the  error-variance  for  systems 
with  x(0)=0.0,  x(0)=1.5,  ^=^=0.1  and  \K=2.25 
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first  or  second  order  approximate  filters,  this  is  due  to  the  fact 

that  { E[x ( t ) h (x ) ] }  is  significantly  different  from  that  of  {V  h'(x)}. 

x 

Figures  (3.5)  and  (3.6)  presents  the  estimator  errors  and 
their  error-variances  for  various  structures  of  filters  with  statistics 
given  as 

x(0)  =  -3.0,  x(0)  =  -1.0 

vy°)  =  o.i,  tu  =  tv  =  i.o 

X 

It  is  interesting  to  observe  that  the  results  obtained  from 
the  proposed  minimum  variance  filter  comes  very  close  to  that  of  the 
quasi -moment  minimum  variance  filter,  whereas,  the  stochastic  linear¬ 
ization  filter  comes  very  close  to  those  of  the  modified  filter  and  the 
extended  Kalman  filter.  Moreover,  the  error-variances  of  the  last 
three  filters  are  considerably  lower  than  those  of  the  first  two  filters, 
while  the  estimator  errors  for  the  last  three  filters  are  considerably 
greater  than  those  of  the  first  two  filters.  The  fact  that  the  last 
three  filters  underestimate  their  error-variances  is  quite  undesirable, 
in  some  cases,  it  may  produce  detrimental  effects,  since  the  filters 
think  they  are  performing  very  well, but  in  reality  they  are  not. 

Using  this  particular  set  of  statistics,  it  can  be  shown  that 

the  random  forcing  term  has  very  significant  effect,  whereas,  the 

difference  between  { E[x ( t )  h (x ) ] }  and  {V%  h'  (x)}  is  very  insignificant- 

x 

The  conclusion  drawn  from  these  theoretically  considerations  agree  well 
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Figure  3.6 


with  the  preceeding  experimental  observations. 


In  Figures  (3.7)  and  (3.8),  the  prior  statistics  are  chosen 
as 

x(0)  =  -0.5,  x(0)  =  0.5 

VJO)  =  5.0,  ¥  =  ¥  =  1.0 

X 

The  estimator  errors  obtained  from  various  structures  of 
finite  dimensional  filters  are  moderately  different  from  one  another. 

In  particular,  the  first-order  extended  Kalman  filter  is  almost  as 
good  as  the  second  order  modified  minimum  variance  filter, and  is 
moderately  better  than  the  second-order  quasi -moment  minimum  variance 
fi  1  ter. 

However,  the  error- variances  appear  to  cluster  into  two 
significantly  different  results,  the  results  taken  from  the  proposed 
minimum  variance  filter,  the  stochastic  linearization  filter  and  the 
quasi-moment  minimum  variance  filter  cluster  together,  and  are 
significantly  smaller  than  those  of  the  modified  minimum  filter  and  the 
extended  Kalman  filter. 

It  is  obvious  that  some  of  the  various  second-order  filters 
and  the  stochastic  linearization  filter,  may  have  very  distinct 
performance  characteristics,  in  some  cases,  their  performance  character¬ 
istics  may  be  almost  as  good  as  those  of  the  proposed  minimum  variance 
filter;  but  in  other  cases,  they  may  produce  terribly  poor  estimates, 
even  worse  than  the  first-order  extended  Kalman  filter,  worst  of  all, 
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they  may  underestimate  their  error- variances .  This  makes  the  second- 
order  filters  and  the  stochastic  linearization  filter  very  undesirable 
and  unreliable. 

Figures  (3.9)  -  (3.11)  are  concerned  with  another  system 

given  by 

x  =  -x  +  5  +  u 

3 

y  =  x  +  v 

and  theoretical  discussion  and  experimental  observation  similar  to 
those  of  Figures  (3.1)  to  (3.8)  can  be  carried  out.  They  indicate  that 
the  performance  characteristics  of  the  stochastic  linearization  filter 
and  the  proposed  minimum  variance  filter  are  much  superior  to  those  of 
the  first  and  the  second  order  approximate  filters. 

Figures  (3.12)  -  (3.23)  are  concerned  with  the  Van  der  Pol 

oscillator  with  systems  described  by 

• 

X1  =  x2 

•  2 

x2  =  -  xi  +  3x^(1  -  x]  ) 

and 

3 

y  =  x-j  -  0. 1  x-j  +  v. 

Figures  (3.12)  and  (3.13)  are,  respectively,  the  measurements 
y  for  ¥  =  5.0  and  0.5. 

From  Figures  (3.14)  -  (3.21),  it  is  obvious  for  both  cases  of 
prior  statistics,  the  performance  characterise cs  of  the  proposed  minimum 
variance  filter  are  much  superior  to  those  of  the  stochastic  linearization 
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Figure  3.9  The  output  run  of  the  estimator  error  for  systems 
with  x(0)=0.0,  x(0)=1.5,  =0.1  and  \K= 2.25 


. 


' 


ERROR  VARIANCE 


X  =  —X  +  5  +  U  ^u  =  0.10 

Y  =  X3  +  V  =  0.10 


V-(0)  =  2.25  X(0)  =  1.5 

x 


Q  QUASI-MOMENT  MIN.  VAR.  FILTER 
T  TRUNCATED  MIN.  VAR.  FILTER 
M  MODIFIED  MIN.  VAR.  FILTER 
V  PROP.  MIN.  VARIANCE  FILTER 
L  STOCHASTIC  LINEARIZATION 


0.2  0.3  0.4 


TIME  (sec) 


Figure  3.10 


The  output  run  of  the  error-variance  for  systems 


with  x(0)=0.0. 


x(0)=1.5,  T=T  =0.1  and  \K= 2.25 

U  V  X 


0.5 


ESTIMATOR  ERROR 


54 


Figure  3.11  The  output  run  of  the  estimator  error  for  systems 


with  x(0)=0.0,  x(0)=-3.0,  ¥  =v  =1.0  and  2.0 
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Figure  3.12  The  output  run  of  the  measurement  y  for  systems  with  ^-5.0 
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Figure  3.14  The  output  run  of  x-j  and  the  estimates  for  Vi'v= 5 . 0 , 
V1  -]=5. 0,  V12=0.0  and  V22=20 
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Figure  3.15  The  output  run  of  x2  ancl  the  estimates  for  ^=5.0, 
V1  -|  =  5. °,  V12=0.0  and  V22=20 
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Figure  3.17  The  output  run  of  and  the  estimates  for  Tv=0.5, 
Vir5-0,  vi2=0*°  and  v22=20 
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Figure  3.18  The  output  run  of  the  error-variance  for 
systems  with  ^=5.0,  V-|-|=5.0,  V-^O.O  and 
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Figure  3.19  The  output  run  of  the  error-variance  f°r 
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Figure  3.20  The  output  run  of  the  error-variance  V-j  -j  for 

systems  with  ^=0.5,  V-|-|=5.0,  V-^O.O  and  V22=20 
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Figure  3.21  The  output  run  of  the  error-variance  for 
systems  with  T  =0.5,  V-j -j  =5 . 0 »  V-|2=0.0  anc* 
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Figure  3.22  The  output  run  of  x-j  and  the  estimates  for  systems 
with  Y  =10.0,  V-| -|=V22=1 -0  and  V12=0.0 
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Figure  3.23  The  output  run  of  x2  and  the  estimates  for  systems 
with  Y  =10.0,  vii=V22=1‘°  and  V12=0,0 
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filter.  Also,  the  comparison  of  Figures  (3.14)  to  (3.17)  with 
respectively,  those  of  Figures  (3.18)  to  (3.21)  shows  that  the 
stochastic  linearization  filter  underestimate  its  error-variances, 
whereas  the  error- vari ances  obtained  from  the  proposed  filter  clearly 
indicate  the  performance  of  its  filtering  estimates. 

The  above  observations  can  also  be  explained  by  the  fact  that, 
in  these  cases,  the  random  forcing  terms  are  very  significant.  Therefore, 
the  deletion  of  the  random  forcing  terms  may  result  in  misleading 
estimators . 

In  Figures  (3.22)  and  (3.23),  another  set  of  statistics 
was  chosen,  in  this  case,  the  random  forcing  term  is  not  significant 
at  all,  therefore,  it  can  be  expected  that  the  two  different  filters 
yield  much  the  same  results. 

On  the  basis  of  Figures  (3.1)  -  (3.23)  as  well  as  many  more 
runs  not  presented  in  this  thesis,  it  can  be  concluded  that  the 
performance  of  the  proposed  minimum  variance  filter  is  much  superior  to 
any  other  type  of  filter  investigated.  It  also  points  out  the  danger 
of  adopting  any  other  finite  dimensional  filter  ,  since  the  random 
forcing  terms  as  well  as  the  approximation  introduced  to  its  non- 
linearities  may  be  so  significant  that  its  deletion  results  in  un¬ 
acceptable  as  well  as  misleading  system  estimates. 
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3.4  MINIMUM  VARIANCE  GENERAL  CONTINUOUS  NONLINEAR  FILTER 

Consider  the  general  nonlinear  message  model 

x(t)  =  f[x(t),t]  +  G[x(t),t]  w ( t )  +  E[x(t) ,  t]  u ( t )  (3.21) 

with  measurement  given  by 

z(t)  =  h[x(t),t]  +  v(t)  +  y (t)  (3.22) 

where  u(t)  and  y(t)  are  known  input  time  functions,  w(t)  and  v(t)  are 
correlated  noise  processes  with  mean  y  (t)  and  y  (t),  respectively,  and 
also 


Cov{w(t),  v(t)}  =  vwv(t)  6(t-x)  (3.23) 

and  all  other  prior  statistics  follow  that  of  Section  3.2. 

Following  the  development  of  Section  3.2,  and  neglecting  the 
change  of  p[x(t),  t/Z ( t ) ]  due  to  the  differential  measurements  6  z(t), 
then  the  filtering  algorithm  is  given  by 


x(t)  =  f[x(t)  +  x(t),t]  +  G[x(t)  +  x(t),  t]  yw(t) 


+  E[x(t)  +  x ( t ) ,  t]  u ( t )  +  K ( t )  { z ( t )  -  yv(t)  -  y ( t) 


-  h[x(t)  +  x(t),  t] } 


(3.24) 
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where  the  gain  algorithm 


K(t)  =  E{x(t )  hT[x(t )  +  x(t),t]  +  G[x(t)  +  x(t),t]¥  (t)}¥"1(t) 

WV  V 

(3.25) 


and  the  error-variance  equation  is  given  by 

T  T 

V^(t )  =  E{x( t)  f  [x(t)  +  x(t),t]  +  f[x(t)  +  x(t),t]  x  (t)} 
x 

+  E  { G  [x  ( t )  +  x(t)  ,t]'i'w(t)GT[x(t)  +  x(t),t]} 

-  K(t)  fy(t)  KT( t)  (3.26) 

To  the  author's  knowledge,  the  results  presented  in  this 
section  are  new.  In  the  particular  case  that  f  and  h  are  linear,  G 
and  E  are  independent  of  the  state  variable,  namely 


f[x(t),t]  =  F (t )  x(t) 
h[x(t),t]  =  H(t)  x(t) 
G(x(t),t]  =  G(t)  and 
E[x(t),t]  =  E(t) 


Then  Equations  (3.24)  -  (3.26)  are,  respectively, 
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x ( t )  =  F(t)  x(t)  +  G ( t )u  ( t )  +  E (t)  u(t)  +  K( t)  {  z(t) 

W 

-  uv(t)  -  y(t)  -  H(t)  x(t)} 

K(t)  =  [Vjt)  HT(t)  +  6(t)  Twv(t)]  fv'1(t) 

X 

and 

\(t)  =  F(t)  Vjt)  +  \IJt)  FT(t)  +  G(t)  T  (t)  GT(t) 

X  XX 

-  K(t)  fv(t)  KT ( t) 

These  algorithms  agree  well  with  the  general  continuous 
Kalman  filter  [49]. 

3.5  MINIMUM  VARIANCE  CONTINUOUS  NONLINEAR  NOISE-FREE  FILTERING 

Consider  the  continuous  nonlinear  message  model  of  Equation  (3.1) 
with  the  noise-free  measurement  model  given  by 

z ( t )  =  h[x(t),t]  (3.27) 

Where  w(t)  is  zero-mean,  white  noise  with  non-negative  definite  variance 
Y  (t),  h[x(t),t]  is  considered  to  be  of  full  rank,  otherwise  an 
equivalent  z(t)  of  lower  dimension  can  be  used. 

Since  z(t)  is  noise  free  T1((t)  is  non-negative  definite,  when 

w 

z(t)  is  differentiated,  some  of  its  element  may  not  contain  any  white 
noise.  Therefore,  each  element  of  z(t)  has  to  be  differentiated  and 


■ 
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Equation  (3.1)  is  used  to  substitute  for  the  derivative  of  the  state 
variable,  until  white  noise  is  obtained  in  the  derivatives  of  each 
element  in  z(t) . 

The  signals  obtained  can  be  arranged  into  two  sets.  In  the 

first  set 


Z](t)  =  h-j  [x(t)  ,t] 

which  comprises  Equation  (3.27)  and  all  linearly  independent  derivatives 
of  z(t)  that  are  noise  free. 

In  the  second  set 

z2(t)  =  h2[x(t),t]  +  N[x(t),t]  w(t) 

which  comprises  all  derivatives  of  z(t)  that  contain  linearly  independent 
white  noise. 

A 

It  is  assumed  that  the  filtering  estimate  x(t)  is  given  by 
x(t)  =  *[x(t),t]  +  K-j  ( t)  z1  (t)  +  K2( t)  z2(t) 

since  in  place  of  Equation  (3.27),  there  are  two  sets  of  measurements. 

Here,  z-j(t)  is  considered  as  a  known  input,  it  does  not  contain 
any  new  information,  and  following  the  development  of  Section  3.2,  the 


. 
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following  filtering  algorithm  is  obtained 

x (t )  =  f[x(t)  +  x(t),t]  +  K2(t){z2(t)  -  h2[x(t)  +  x(t),t]} 
with  the  gain  algorithm 

K2(t)  =  E{x(t)  h2T[x(t)  +  x(t),t]  +  G[x(t)  +  x(t),t]Tw(t) 

•  NT[x(t)  +  x(t),t]}[E{N[x(t)  +  x(t),t]f (t)  NT[x(t) 

w 

+  Xttj.t]}]'1 

and  the  error-variance  equation  is  given  as 

V  (t)  =  E{x(t)  fT[x(t)  +  x(t),t]  +  f[x(t)  +  x(t),  t]  x  (t ) } 

'  \j 

x 

+  E{G[x ( t )  +  x(t),t]  v  It)  GT[x(t)  +  x(t),t]} 

W 

-  K2(t)E{N[x(t)  +  x(t),t]  ^w(t)NT[x(t)  +  x(t) ,t]}K2T(t) 


To  the  author's  knowledge,  the  results  presented  here  are  also 
new.  In  the  case  that  the  vector  valued  functions  are  linear,  the  results 
presented  here  agree  well  with  that  of  Bryson  and  Johansen  [6]. 
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3 . 6  CONCLUSION 

In  this  chapter,  nonlinear  filtering  algorithms  for  correlated, 
uncorrelated  or  noise-free  processes  were  derived.  To  the  author's 
knowledge  the  filtering  algorithms  presented  for  non-white  noise 
processes  are  new.  In  order  that  the  systems  be  physically  realizable, 
the  assumption  that  the  probability  density  functions  of  the  estimator 
error  are  Gaussian  is  needed  to  expand  the  expectations. 

In  Section  3.3,  various  nonlinear  systems  were  simulated, 
the  noise  processes  assumed  are  white  Gaussian,  they  are  compared  with 
results  obtained  from  various  other  finite  dimensional  approximate 
nonlinear  filters.  The  results  clearly  indicate  the  superiority  of 
the  proposed  minimum  variance  filter  over  those  of  other  filters  invest¬ 
igated,  and  theoretical  explanation  are  also  given  for  the  apparent  poor 
performance  characterise cs  of  various  other  filters  considered. 

In  the  case  that  the  systems  are  linear,  the  proposed  minimum 
variance  filters  agree  well  with  those  of  the  optimal  filters  in 
literature  [6,  49]. 
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CHAPTER  IV 


MINIMUM  VARIANCE  FILTERING  AND  SMOOTHING 
FOR  DISCRETE  NONLINEAR  DELAYED  SYSTEMS  WITH  ADDITIVE  WHITE  NOISE 

4.1  INTRODUCTION 

This  chapter  deals  with  discrete-time  filtering  and  smoothing 
estimation  of  nonlinear  systems  with  multiple  delays  imbedded  in  additive 
white  noise  processes.  In  general,  the  filtering  algorithm  enables 
one  to  estimate  present  values  of  the  variables  of  interest  using 
present  data,  whereas  the  smoother  allows  one  to  estimate  past  values. 

A  typical  smoothing  problem  is  the  post-flight  estimation  of  the  flight 
path  of  a  missile  based  on  tracking  system  measurements  during  the 
entire  duration  of  the  flight.  If  the  estimates  of  the  missile's 
position  and  velocity  at  one  particular  flight  point  are  desired,  the 
estimates  can  be  based  upon  all  the  measurements  recorded,  including 
those  made  before  and  after  that  particular  flight  point. 

In  the  estimation  problems  for  linear  systems  without  delays 
involving  additive  white  noise  processes,  numerous  papers  have  been 
written  to  deal  with  filtering  and  smoothing  estimation,  however,  most 
of  them  are  merely  rederivation  of  earlier  recursive  algorithms 
presented  by  Carlton  [50],  Rauch  [51 ]and  Bryson  and  Frazier  [11],  or 
reformulation  of  problems  using  various  estimation  techniques. 

However,  Kelly  and  Anderson  [52]  pointed  out  that  the 
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algorithms  for  both  discrete  and  continuous-time,  linear,  fixed-lag 
smoothing  given  in  [11,  15,  43,  51]  may  be  unstable,  and  therefore 
impractical.  To  be  more  explicit,  although  the  fixed-lag  smoothing 
equations  are  bounded-input  and  bounded-output  stable,  realizations  of 
these  in  [43,  51]  contain  a  subsystem  which  is  unstable  in  the  sense 
of  Lyapunov.  In  [53],  it  is  pointed  out  that  the  apparent  culprit  is 
an  uncontrollable  and  unstable  block  in  the  smoother  state  equations 
which  can  be  removed  without  affecting  the  input-output  characteristics. 

In  [38],  a  computationally  stable  smoothing  algorithm  is 
derived  for  linear  discrete  systems  containing  time  delays,  using  the 
method  of  orthogonal  projection.  The  smoother  for  linear  discrete 
systems  without  delays  can  be  considered  as  a  special  case  of  the  above 
problem,  with  time  delay  index  setting  to  zero.  The  results  of  [38] 
are  rederived  in  a  simple  manner  in  [39],  using  the  state  augmentation 
technique.  However,  the  smoothers  derived  in  such  a  manner  are  of  nN 
dimensions,  where  n  is  the  order  of  the  message  model  and  N  is  the 
amount  of  the  fixed-lag. 

In  this  chapter,  the  matrix  minimum  principle  is  applied  to 
nonlinear  discrete-time  multiple  delays  systems,  resulting  in  new 
dynamic  discrete  estimation  algorithms;  they  are  reported  in  [53,  54]. 
In  Section  4.2,  the  message  model  involves  multiple  time  delays  but 
the  measurement  is  without  delay.  In  Section  4.3,  more  general  non¬ 
linear  system  models  are  considered,  where  both  the  message  and 
measurement  models  contain  multiple  time  delays. 


' 
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For  nonlinear  estimation  problems,  the  results  obtained  in 
this  chapter  are  new,  recursive  in  nature  and  directly  yield  the 
fixed-interval,  fixed-lag,  fixed  point  smoothing  and  the  filtering 
algorithms.  The  derivation  is  straightforward  and  shows  the  close 
link  between  the  smoothing  and  filtering  estimation  algorithms. 

The  estimation  algorithms  presented  in  this  chapter,  can  be 
easily  applied  to  various  special  cases  of  nonlinear  estimation  problems; 
for  example,  the  estimation  problem  for  nonlinear  systems  without 
delays,  the  estimation  of  nonlinear  systems  with  multiple  delays  only 
appearing  in  the  measurements,  the  linear  estimation  of  systems  with 
or  without  delays,  etc. 

In  the  particular  case  of  linear  message  and  measurement 
models  with  non-delayed  measurements,  the  proposed  filtering  and  fixed-lag 
smoothing  algorithms  agree  well  with  stable  estimation  algorithms 
presented  by  Priemer  and  Vacroux  [37-38],  and  later  extended  by  Biswas 
and  Mahalanabis  [55],  and  Shukla  and  Srinath  [56],  to  fixed  point 
smoothing  of  the  same  systems  and  fixed-lag  smoothing  for  systems  with 
multiple  delays  in  the  measurements,  respectively. 

4.2  NONLINEAR  DISCRETE  DELAYED  SYSTEMS  WITH  NON-DELAYED  MEASUREMENTS 

This  section  is  concerned  with  nonlinear  discrete-time  dynamic 
systems  containing  multiple  time  delays  with  non-delayed  measurement 
sequence  imbedded  in  additive  white  noise  processes. 


In  Section  4.2.1,  the  problem  statement  is  presented.  Section 


■ 
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4.2.2  presents  the  derivation  of  the  nonlinear  smoother.  Sections 

4.2.3  to  4.2.6,  provide  handy  sets  of  reference  equations  respectively, 
for  the  fixed-lag,  fixed-point,  fixed-interval  smoothing  and  the 
filtering  algorithms.  Section  4.2.7  shows  the  applicability  of  the 
presented  algorithms  to  linear  estimation  problems  and  also  various 
other  special  cases  of  estimation  problems  they  were  identified  with 
existing  algorithms  in  literature. 

4.2.1  The  Problem  Statement 

The  processes  considered  in  this  section  are  described  by 
the  following  vector  difference  equation 

L 

x(k+l)  =  l  fi[x(k-j),  k-j]  +  G[x(k),k]  w(k)  (4.2.1) 

j  =  o  J 

where  the  measurement  is  given  by 

y(k)  =  h[x(k),k]  +  v(k)  (4.2.2) 

Here,  x  the  state  is  an  n-vector;  y  the  measurement,  an  m-vector;  w 
the  random  input,  an  r-vector;  v  the  measurement  noise,  an  m-vector; 

G,  a  nonlinear  state  dependent  n  x  r  matrix;  k  =  0,1,  ...  is  the 
discrete  time  index.  The  nonlinear  vector  valued  functions  f .  and  h 
are,  respectively,  n  and  m  -  dimensional. 

The  random  vectors  w(k)  and  v(k)  are  independent  zero-mean 
white  Gaussian  sequences,  for  which 


ififM 
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E{w(k)  wT(j)>  =  Tw(k)  6kj 

E{ v ( k )  vT(j)}  =  fv(k)  6kj 
and 

E{w(k)  vT(j)}  =  0 

for  all  integers  k  and  j,  where  E{-}  denotes  the  expectation  operator, 

T  the  matrix  transpose,  6.,  the  Kronecker  delta,  and  v  and  v  are 

J  K  V  W 

m  x  m  and  r  x  r  positive  definite  matrices,  respectively. 

The  initial  states  x (0)  and  x(-j),  for  j  =  1,  ...»  L  are  zero- 
mean  Gaussian  random  vectors,  independent  of  v(k)  and  w(k),  for  which 

E{x(-j)  wT ( k } }  =  0 
and 

E{x(-j)  vT(k)>  =  0 

for  j  =  0,1  ...,  L,  and  k  =  0,1,  ...,  with  a  positive  definite  covariance 
matrix 


E(x(-j)  xT(-£)>  =  Vx(j,&) 

for  j ,  £  =  0 ,  . . . ,  L . 

A 

The  smoothing  problem  is  to  obtain  x(k-£+l/k+l ) ,  the  unbiased 


. 


smoothing  estimate  of  x(k-£+l),  with  0  £  £  £  k+1 ,  conditioned  on 
the  set  of  measurements 


Y(k+1)  =  (y (0) ,  y(l ) ,  ...»  y(k+l)} 


such  that  the  cost  function 


k+1 

J  =  l  Trace  [M(j)  V  ( j-Jl+1  /j+1 )]  (4.2.3) 

J  =  0  X 


is  minimized.  Here  M(j)  is  some  non-negative  definite  weighting  matrix, 

and  Vjj-A+l/j+1)  is  defined  by 
x 


V  (j-A+l/j+1)  =  E.+,[{x(j-£+l )  -  x(j-£+l/j+l )} 
x  J 


{x(j-4+l)  -  x(j-4+l/j+l)}T]  (4.2.4) 


where  E  j + -j  { • }  denotes  the  expectation  operation  conditioned  on  the 
set  of  measurements  Y(j+1). 

Since  M(j)  is  known,  minimizing  Equation  (4.2.3)  is  equivalent 
to  minimizing  the  trace  of  V  ( j-£+l/j+l )  at  each  individual  sample  point 
J  [49]. 

4.2.2  The  Derivation  of  the  Smoother 

It  is  assumed  that  the  smoothing  algorithm  is  described  by  the 


nonlinear  differential  equation 
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L 

x(k-£+l/k+l)  =  l  b.[x(k-£-j/k), 

j=o  J 


k-i-j]  +  K*+1y(k+l) 


(4.2.5) 


Here,  the  assumption  of  linearity  in  innovations  is  made.  The  nonlinear 

L  .  5 

functions  l  b  .[x(k-£-j/k) ,  k-£-j]  and  K*  are  yet  to  be  determined. 
j=o  J  K  1 

In  fact,  the  smoothing  equation  may  take  various  forms, 

A 

however,  it  is  essential  that  enough  information  concerning  x(k-£-j/k) 
and  y(k+l)  are  included  in  the  estimator  model.  Since  the  message 
model  considered  in  this  section  consists  of  the  sum  of  nonlinear 
functions  containing  multiple  time  delays,  it  is  realistic  to  assume  that 
the  smoothing  estimate  is  a  linear  combination  of  the  sum  of  nonlinear 


functions  of  x(k-£-j/k),  to  account  for  the  time  delay  characteristics, 

A 

and  the  present  measurement  y(k+l).  Here  x(k-£-j/k)  is  assumed  to  have 
made  optimum  use  of  all  the  measurements  up  to  y(k). 


The  problem  formulated  in  such  a  manner  may  therefore  lead 
to  a  smoother,  optimum  with  respect  to  the  imposed  constraints,  but 
not  identical  to  the  truly  optimal  one. 

In  the  meantime,  it  is  interesting  to  note  that  even  though 
the  smoothing  algorithm  may  be  described  by  other  dynamic  equations 
such  as 


x(k-ul/k+l)  =  l  b.[i(k-Ji-j/k),  k-*-j]+  K*  y(k+l/k)  (4.2.6) 

j=o  J 

where  y(k+l/k)  =  y(k+l)  -  E^{y(k+1)}.  It  represents  the  "new"  information 
contained  in  y(k+l),  since  the  best  estimate  of  y(k+l)  given  Y(k),  namely. 


E^yU+l)}  is  subtracted  from  y(k+l)  to  obtain  y(k+l/k),  which  is 
generally  known  as  the  innovation.  The  resulting  smoother  using  either 
Equation  (4.2.5)  or  (4.2.6)  would  yield  exactly  the  same  dynamic 
smoother.  This  can  be  explained  by  the  fact  that  the  two  equations 
contain  exactly  the  same  amount  of  information,  that  is,  y(k+l)  contains 
as  much  information  as  that  of  its  innovation  y(k+l/k)  [14]. 


Now  the  estimation  problem  is  to  determine  the  time-varying 

L 

nonlinear  vector  function  l  b  .[x(k-£-j/k) ,  k-£-j],  and  the  algorithm 

■i  =n  J 


o  ° 

such  that  the  trace  of  V^(k-£+l/k+l )  is  minimized. 


x 

Let  x(k-£+l/k+l)  denote  the  smoothing  error  defined  by 


x(k-£+l/k+l )  =  x (k-£+l )  -  x(k-£+l /k+1 ) 


(4.2.7) 


then 


L 

x(k-£+l/k+l)  =  l  f Jx(k-Z-j) ,  k-£-j]  +  G[x(k-£) ,k-£]w(k-£) 

j=o  J 


J=o 


+  v  ( k+ 1 ) } 


(4.2.8) 


In  order  that  x(k-£+l/k+l)  is  an  unbiased  smoothing  estimate, 
it  is  necessary  that 


L 


L 


l  b.[x(k-£-j/k) ,  k-£-j ]  =  l  f-[x(k-£-j),  k-£-j/k] 
j=o  3  j=o  J 


-  K^+1h[x(k+l),  k+1 /k] 


(4.2.9) 
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where 

and 


algorithm 


for  k  =  0 
Where 

Equation 

where 


fj[x(k-£-j ) ,  k-£-j/k]  =  Ek{f -[x(k-Ji-j) ,  k-£-j]} 

h[x(k+l ) ,  k+l/k]  =  Ek{h[x(k+1),  k+1]} 

Substituting  Equations  (4.2.9)  into  (4. 2. 5), the  smoothing 
becomes 


x(k-£+l/k+l )  =  x(k-£+l/k)  +  Kk+1  {y (k+1 ) 


-  h[x(k+l),  k+l/k]} 


(4.2.10) 


1 ,  . . . ,  and  0  <_  £  <_  k+1 . 

x(k-£+l/k)  =  l  f.[x(k-£-j),  k-£-j/k]  (4.2.11) 

j=o  J 

4.2.8)  can  be  rewritten  as 

x(k-£+l/k+l )  =  x(k-£+l/k)  -  K^+1{?i[x(k+l)/k]  +  v(k+l)}  (4.2.12) 
?![x(k+l )/k]  =  h[x(k+l ) ,  k+1]  -  h[x(k+l),  k+l/k]  (4.2.13) 


Therefore,  the  smoothing  error-vari ance  equation  is  given  by 


' 


. 
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T 

Vjk-*+l/k+l)  =  Vjk-£+l/k)  -  E.{x(k-£+l/k)  ft  [x(k+l)/k]} 

X  x 

Kk+rKk+lEk{^[x(k+1)/k]  xT(k-n+l/k)} 

+  Kk+i  »v(k+1>  Kkll  <4-2-14) 

+  Kk+1Ek{li[x(k+l)/k]  [x(k+l)/k]} 


,£ 


Here,  Kk+-j  is  the  only  variable  available  for  manipulation. 

The  necessary  condition  for  minimizing  the  trace  of  V  (k-£+l/k+l)  and 

rl/ 

x 

subject  to  the  constraint  of  Equation  (4.2.14)  is  provided  by  the  matrix 


mi 


nimum  principle,  for  which  Kk+-j  is  considered  as  the  control  variable, 


The  necessary  condition  can  now  be  obtained  from  the  condition 


—0 —  Trace[V  (k-t+l/k+1)]  =  [0] 

<+i  * 


where  [0]  is  the  null  matrix,  and  the  result  is 


(4.2.15) 


T 

=  Ek(x(k-£+l/k)  ft  [x(k+l)/k]}  |>v(k+l) 


T 

+  Ek{ft[x(k+l)/k]  ft  [x(k+l)/k]} 


-1 


(4.2.16) 


and  Equation  (4.2.14)  becomes 
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V  (k-£+l/k+l )  =  V  (k-£+l/k) 

\j  t\i 

X  x 


T 


‘  Kk+lEk{^[x(k+1)/k]x  (k^+1/k)} 


(4.2.17) 


And  since,  it  is  seen 


where 


x(k-£+l/k)  =  V  ?.[x(k-£-j)/k]  +  G[x(k-£) ,k-£]w(k-£) 
j=o  J 


f j [x ( k-£- j )/k]  =  f j [x ( k-£- j ) ,  k-£-j] 


then 


-  fj[x(k-£-j ) ,  k-£-j/k] 


L  L  ^ 

Vjk-A+l/k)  =  y  l  Ek{fi (k-£-i )/k]  f i [x (k— £— j ) / k ] } 

x  i=o  j=o  J 


+  G [x ( k-£ ) ,  k-£]  v  (k-£)  GT [x ( k-£ ) ,  k-£]  (4.2.18) 

w 


and  one  can  also  obtain 


Vjk-£-i,  k-£- j/k )  =  V^(k-£-i ,  k-£-j/k-l) 
x  x 

-  K^+1Ek_1(h[x(k)/k-l]x  ( k-£- j/k-1 ) } 

(4.2.19) 

for  0  <  £  +i ,  £  +j  <  k,  where  V  (k-£-i,  k-£-j/k)  is  defined  by  the 

x 


relation 
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T 

Vjk-A-i,  k-A-j/k)  =  E,{x(k-£-i/k)  x  (k-A-j/k)} 
x 

For  k<0,  there  is  no  input  to  the  smoother  and  therefore  one 

A 

can  set  x(-j/-l)  to  zero  for  j  =  0,1,  L,  which  results  in 

x(-j/-l)  =  x(-j) 
and 

Vj-j,-£/-l)  =  Vx(j,A) 

X 

for  j,  l  =  0,1,  ...»  L. 

Since  the  smoothing  estimator  is  unbiased,  the  expectations 
that  are  in  Equations  (4.2.16)  -  (4.2.19)  can  be  replaced  by  the 
following 

Ek(x(k-jl+l/k)  ft  [x(k+l  )/k]}=  Ek{x(k-it+l/k)hT[x(k+l ) ,  k+1]} 

(4.2.20) 

Ek{f![x(k+l)/k]?iT[x(k+l)/k]}=  Ek{h[x(k+1),  k+l]hT[x(k+l ) .  k+1]} 

,T 

-  h[x(k+l ) ,k+l/k]h  [x(k+l ),k+l/k] 

(4.2.21) 


and 


■ 
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Ek(^i [x(k-£-i )/k]  fj[x(k-£-j )/k]} 

=  Ek{f1-[x(k-A-i),  k-Ji-i]  f T [x(k-JL-j ) ,  k-£-j]}  (4.2.22) 

-  f1*[x(k-A-i),  k-A-i /k]  fj[x(k-£-j),  k-A-j/k] 

It  should  be  noted  that  in  the  case  of  nonlinear  systems 
the  above  expectations  require  infinite  dimensional  systems  for  their 
realization.  The  problem  of  obtaining  a  good  approximation  to  the 
above  expectations  is,  therefore,  of  practical  importance. 

For  practical  realization,  it  is  assumed  that  the  conditional 
probability  density  functions  of  the  smoothing  error  x  are  Gaussian, 
then  the  expectation  can  be  obtained  without  any  further  approximation 
for  systems  with  product  types  or  polynomial  nonlinearities. 

To  evaluate  the  above  expectation,  one  can  make  use  of 
Equation  (4.2.7)  to  replace  the  state  variables  by  the  sum  of  their 
respective  estimators  and  the  estimator  errors.  For  example,  in  the 
scalar  case  of 

h[x(k+l ) ,  k+1]  =  x3(k+l ) 

from  Equations  (4.2.20)  and  (4. 2. 7), it  can  be  shown  that 
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Ek(x(k-£+l/k)  h  [x (k+1 )/k] }  =  Ek(x(k-£+l/k) 

3  2 

•[x  (k+l/k)  +  3  x  (k+l/k)  x(k+l/k)  +  3  x(k+l/k) 

.2  a3  a2 

•  x  (k+l/k)  +  x  (k+l/k)]}  =  3  Mj k-£+l ,  k+l/k)  x  (k+l/k) 

x 

+  3  V^(k-£+l , k+l/k)  VJ k+l/k) 
x  x 

Notice  that  in  the  evaluation  of  the  expectation,  the  only 
assumption  needed  is  the  Gaussian  assumption  of  the  estimator  error. 

And  it  can  be  easily  shown  that  if  the  nonlinearities  were  expanded  in 
terms  of  first  or  second  order  Taylor's  series,  the  last  term  in  the 
preceeding  equation  would  have  been  dropped.  In  such  a  case  the 
resulting  filter  is  equivalent  to  an  intelligent  application  of  an 
extended  Kalman  filter. 

Similarly,  it  is  obtained  from  Equations  (4.2.21)  and  (4.2.7) 

T 

Ek(fi[x(k+1 )/k]  ft  [x( k+1 )/k] } 

=  Ek(x6(k+1)}  -  Ek{x3(k+1)}2  =  [15  V3(k+l/k) 

X 

2 

+  45  lr(k+l/k)  x  (k+l/k)  +  15  Vj  k+l/k) 
x  x 

.4  a6  a3 

•  x  (k+l/k)  +  x  (k+l/k)]  -  [x  (k+l/k)  +  3  V  (k+l/k) 

X 


' 

. 


•  x(k+l/k)]2  =  15  V3 (k+l/k)  +  36  V2(k+l/k)  x  (k+l/k) 

X  X 

A 

+  9  Vj  k+l/k)  x  (k+l/k) 
x 

The  preceeding  evaluation  is  rather  straightforward.  In 
the  particular  case  that  higher-order  terms  do  not  improve  system 
accuracy,  one  can  delete  the  higher-order  terms  and  approximate  the 
expectations  up  to  arbitrary  order,  as  desired. 

4.2.3  The  Nonlinear  Fixed-Lag  Smoothing 

Replace  k+1  and  %  by  k  and  N,  respecti vely,  where  N<K,  from 
Equations  (4.2.10),  (4.2.11),  (4.2.16)  and  (4.2.19)  one  would  then 
obtain  the  following  recursive  nonlinear  fixed-lag  smoothing  algorithms: 

x(k-N/k)  =  x ( k-N/ k- 1 )  +  K^{y(k)  -  h[x(k),  k/k-1 ]}  (4.2.23) 

L  „ 

x ( k-N/ k- 1 )  =  l  f •  [x ( k-N- j ) ,  k-N-l-j/k-1]  (4.2.24) 

j=o  J 

Kk  =  Ek-i^(k-N/k-l)  h  [x(k)/k-l]}|>v(k)  +  Ek_1(h[x(k)/k-l] 

*T  1 

h  [x(k)/k-l]}]  1  (4.2.25) 

V  (k-N/k)  =  V  (k-N/k-1 ) 
x  x 

-  Ek_1th[x(k)/k-l]  x  (k-N/k-1 )} 


(4.2.26) 


- 


89 


L  L  * 

V  (k-N/k-1)  =  l  £  E.  -.{f  -[x(k-l-N-i  )/k-l] 
x  i  =  o  j=0  k_l  1 


(4.2.27) 


T 


•  fj[x(k-l-N-j)/k-l]}  +  G[x(k-1-N) ,  k-l-N] 


•  (k-l-N )  G  [x(k-l-N) ,  k-l-N] 


V^(k-1 -N-i ,  k-l-N- j/k-1 )  =  V  (k-l-N-i,  k-l-N-j/k-2)  (4.2.28) 

x  x 


Also 


and 


,N+i 


O/ 


-  Kk-iEk-2thCx(k_1 )/k_2] 


T 

x  (k-l-N-j/k-2)} 


L  nj 


V%( k-N ,  k+l/k)  =  E.(x( k-N/k)  l  f • [x(k-j )/k] } 
x  j=o  J 


L  L  ^  -vT 

V„(k+l/k)  =  l  I  Ek{fi[x(k-i)/k]  f.[x(k-j)/k]}  (4.2.29) 


1=0  j=o 


+  G[x(k),  k]  ^w(k)  GT [x ( k ) ,k] 

4.2.4  The  Nonlinear  Fixed-Point  Smoothing 

Setting  Z  =  K  -  N+l  where  k+1  >  N,  from  Equations  (4.2.10), 
(4.2.11),  (4.2.16)-  (4.2.18),  one  would  then  have  the  following 
recursive  nonlinear  fixed-point  smoothing  algorithms. 


,k-N+l 


x(N/k+l)  =  x(N/k)  +  Kk~-j  (y ( k+1 )  -  h[x(k+l),  k+l/k]} 
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x (N/k)  =  l  f.[x(N-l-j),  N-l-j/k] 
j=o  J 

Kk+1  +  1  =  Ek{*<N/k)  h  [x(k+l)/k]}  [Tv(k+1) 

+  Ek(h[x(k+l)/k]  h  [x(k+l)/k]}]_1 
VjN/k+1)  =  VjN/k)  -  k[^+1  Ek{h[x(k+l)/k]  x(N/k)} 

X  X 

L  L  J 

VjN/k)  -  l  I  Ek{f.[x(N-l-i)/k]  f j[x(N-1  -j )/k]} 

x  i=o  j=o  J 

+  G[x(N-l  ),N-1]  v(N-l)  GT[x(N-l),N-l] 

w 

VjN-l-i, N-l-j/k)  =  V^(N-l-i  ,N-l-j/k-l ) 
x  x 

-  K^'N+1+1  Ek_1{h[x(k)/k-l]  x T (N-l-j/k-1 ) } 


and 

L 

V  (N,  k+l/k)  =  E.{x(N/k)  l  fi[x(k-j)/k]} 

X  j  =  0  J 


V  (k+l/k)  is  the  same  as  Equation  (4.2.29),  and  also  from 
x 

Equation  (4.2.19),  it  can  be  shown  that 


V  (N, 

X 


k-j/k)  =  VjN,  k-j/k-1 )  -  Kk"N 


Ek.-|(h[x(k)/k-l]x  (k-j/k-1)} 
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4.2.5  The  Nonlinear  Fixed-Intewal  Smoothing 

Setting  k+l=N,  £=N-k,  where  N>k,  Equations  (4.2.10),  (4.2.11), 
(4.2.16)  -  (4.2.19),  would  yield  the  following  recursive  nonlinear  fixed- 
interval  smoothing  algorithms. 


x(k/N)  =  x( k/N-1 )  +  Kjj‘k  {y  (N)  -  h[x(N),  N/N-l]} 

x ( k/N- 1 )  =  l  f,[x(k-l-j),  k-l-j/N-1] 
j=o  J 

KN*k  =  EN_i^(k/N-l)  h  [x(N)/N-l]} 

[fv(N)  +  EN_1{h[x(N)/N-l]  h[x(N)/N-l]>]_1 
Vjk/N)  =  Vjk/N-l)  -  k[j'k  E„  ,{h[x(N)/N-l]  xT(k/N-1  )> 

X  X 

V  (k/N— 1 )  =  l  I  E N_ 1[fi[x(k-l-i)/N-l] 

X  1=0  j=0 

C[x(k-l-j)/N-l]]  +  G[x(k-l),k-l]Tw(k-1)GT[x(k-l),k-l] 
J  w 

V  (k-l-i , k-l-j/N-1]  =  V  (k-l-i,k-l-j/N-2) 

x  x 

-  k[|:k+iEN_2{h[x(N-l)/N-2]xT(k-l-j/N-2)} 

L 

Vjk,  N/N-l)  =  E  ,{x(k/N-l)  I  fi[x(N-l-i )/N-l]} 
x  i-o 


and 


t  ' 


4.2.6  The  Nonlinear  Filtering 

The  nonlinear  filtering  algorithm  can  be  easily  obtained  from 
Equations  (4.2.10),  (4.2.11),  (4.2.16)  -  (4.2.19),  by  setting  £  =  0. 

x(k+l/k+l )  =  x(k+l/k)  +  K°+1{y(k+l)  -  h[x(k+l),  k+l/k]} 

x(k+l/k)  =  l  f.[x(k-j),  k-j/k] 
j=o  J 

Kk+1  =  Ek{x(k+l/k)  h[x(k+l)/k]}[fv(k+l) 

+  Ek{h[x(k+l)/k]  h  [x(k+l )/k]}]_1 
and 

n  ^  T 

Vjk+l/k+1)  =  Vj  k+l/k)  -  K°+1(h[x(k+l)/k]  x  (k+l/k)} 

X  X 

whereas 

V  (k+l/k)  is  given  by  Equation  (4.2.29). 
x 

4.2.7  Estimation  in  Linear  Discrete  Systems 

In  order  to  provide  an  insight  into  the  structure  of  the 
smoothed  estimate,  consider  the  particular  case  of  linear  systems  and 
measurements  with 

L  L 

I  fi[x(k-j),k-j]  =  l  F.(k)  x(k-j) 

j  =o  3  j=o  3 


h[x(k),k]  =  H(k)  x(k) 


. 
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and 

G[x(k) 9k]  =  G(k) 


Then  the  linear  fixed-lag  smoothing  algorithm  can  be  directly 
obtained  from  Equations  (4.2.23)  to  (4.2.28),  respectively  as  the 
following 


x(k-N/k)  =  x ( k-N/k- 1 )  +  K^{y(k)  -  H(k)  x(k/k-l)} 


x ( k-N/ k- 1 )  =  l  F. (k-N-1)  x( k-N-1- j/k-1 ) 
j=o  J 

kjj  =  Vjk-N,k/k-l)  HT(k){H(k)  (k/k-1 )  HT(k)  +  T  (k)}"1 
x  x 

Vjk-N/k)  =  V^(  k-N/k-1 )  -  H(k)  Vjk  ,k-N/k-l ) 
xx  x 

L  L 

V  (k-N/k-1)  =  l  l  F .  (k-N-1 )  Vjk-l-N-i.k-l-N-j/k-l) 

x  i=o  j=o  1  x 

fT ( k-N-1 )  +  G(k-l-N)  fw(k-l-N)  GT(k-l-N) 


V  (k-1 -N-i ,k-l -N- j/k-1 )  =  V^(k-l-N-i ,k-l-N-j/k-2) 
x  x 

-  K^lCHCk-1  )V^(k-l  ,k-l-N-j/k-2>] 

X 


V  (k-N.k+l/k)  =  l  V  (k-N,k-j/k)  fT ( k ) 

.  \j  J 

X  J=0  X 


Also 


94 


and  finally, 

L  L  T 

Vjk+l/k)  =  l  l  F  (k)  Vjk-i,k-j/k)  F1  (k) 

x  i=o  j=o  x  J 

+  G(k)  fw(k)  GT(k) 

It  can  be  easily  identified  that  the  structure  of  the  above 
linear  fixed-lag  smoother  is  simply  the  stable  fixed-lag  smoother 
introduced  by  Priemer  and  Vacroux  [38].  In  the  same  manner,  one  can 
identify  the  linear  fixed-point  smoother  for  linear  systems  without 
delays  with  that  of  Biswas  and  Mahalanabis  [55].  Similarly,  the  results 
presented  in  Section  4.2.5  and  4.2.6  can  be  easily  applied  to  yield  the 
fixed-interval  smoothing  and  the  filtering  algorithms,  respectively. 

Thus,  a  unified  approach  to  obtain  the  filtering  and  smoothing 
algorithms  for  linear  as  well  as  nonl inear, del ayed  as  well  as  non- 
delayed  systems  has  been  presented. 

4.3  NONLINEAR  DISCRETE  DELAYED  SYSTEMS  WITH  MEASUREMENTS  CONTAINING 
MULTIPLE  DELAYS 

In  this  section,  nonlinear  smoothing  algorithms  are  derived 
for  discrete  nonlinear  delayed  systems  with  multichannel  measurements 
containing  time  delays,  imbedded  in  additive  white  noise  processes. 

Section  4.3.1  presents  the  problem  statement.  Section  4.3.2 
is  presented  with  the  derivation  of  the  nonlinear  smoother.  Section 

4.3.3  gives  an  illustrative  example  to  show  how  one  can  easily  obtain 


■ 

. 
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the  fixed-lag,  fixed-point,  fixed-interval  smoothing  as  well  as  the 
filtering  algorithms.  It  also  illustrates  the  use  of  the  presented 
algorithms  to  arrive  at  physically  realizable  nonlinear  estimators. 
Reference  equations  for  various  smoothing  as  well  as  filtering  algorithms 
are  not  presented,  since  they  can  be  easily  obtained  from  Section 
4.3.2  by  properly  substituting  the  discrete  time  indices. 

4.3.1  The  Problem  Statement 

Consider  the  discrete  nonlinear  message  model  of  Equation 

4.2.1  with  the  multi-channel  measurements  given  by 

N 

y( k)  =  l  h.[x(k-j),  k-j]  +  v(k)  (4.3.1) 

j=o  J 

where  h.  is  an  m-dimensional  nonlinear  vector  valued  function.  All 
J 

other  prior  statistics  and  initial  conditions  follow  that  of  Section  4.2.1. 
The  smoothing  problem  is  to  obtain  the  unbiased  smoothing 

/\ 

estimate  x(k-£+l/k+l)  of  the  delayed  state  x(k-£+l),  where  0  ^  £  <_  k+1 , 
conditioned  on  the  measurements 

Y(k+1 )  =  (y (0) ,  y( 1 ) ,  ...,  y(k+l)} 

such  that  the  cost  function 

J (k+1 )  =  Trace[M(k)  Vjk-£+l/k+l )] 

x 


(4.3.2) 


\ 


is  minimized.  Where  M(k)  is  also  a  non-negative  definite  weighting 
matrix. 

Here,  a  different  cost  function  is  purposefully  chosen.  In 
outward  appearance,  the  cost  function  of  Equation  (4.2.3)  is  different 
from  that  of  (4.3.2).  In  real  essence,  the  two  cost  functions  are 
exactly  the  same,  since  M(k)  is  known,  it  can  be  shown  that  minimizing 
Equation  (4.2.3)  is  the  same  as  minimizing  (4.3.2),  and  is  also 
equivalent  to  minimizing  the  trace  of  V  (k-A+l/k+1)  [49]. 

'X/ 

x 

4.3.2  The  Derivation  of  the  Smoother 

With  reasoning  similar  to  that  of  Section  4.2.2,  the  smoothing 
algorithm  is  assumed  to  be  constrained  by  the  following  dynamic 
equati on* 


x(k-£+l/k+l )  =  l  b  -  [x(k-Ji-j/k) ,  k-A-j]  +  k5  ,  y(k+l)  (4.3.3) 

j=o  J  K  1 


*  It  can  be  shown  that  other  dynamic  equations  such  as 

i(k-A+l/k+l)  =  l  b.[x(k-A-j/k),  k-A-j]  +  K^+1  y (k+l/k) 

j=o  J 

where  y (k+l/k)  =  y(k+l)  -  E[y (k+1 )/Y( k) ] 


would  result  in  exactly  the  same  smoothing  algorithm. 


' 

■ 

- 


97 


Now  the  problem  is  to  determine  the  time-varying  nonlinear 
L 

vector  valued  function  £  b . [x(k-£-j/k) ,  k-£-j]  and  the  gain  algorithm 

£  J=0  J 

K|<+-j  such  that  the  trace  of  V^(k-£+l/k+l )  is  minimized. 

x 

Let  x(k-£+l/k+l)  denote  the  smoothing  error  defined  by 


x(k-£+l/k+l )  =  x(k-£+l)  -  x(k-£+l/k+l) 


(4.3.4) 


then  one  can  obtain 

L 

x(k-£+l/k+l )  =  l  f . [x(k-£-j ) ,  k-£-j]  +  G[x(k-£),  k-£] 

j=o  J 

L 


•  w(k)  -  l  b  .[x(k-£- j/k) ,  k-£-j] 

j=o  J 


N 

{  l  h.[x(k-j+l),  k-j+1]  +  v(k+l ) } 
j=o  J 


y\ 


In  order  that  x(k-£+l/k+l)  is  an  unbiased  smoothing  estimate,  it  is 
necessary  that 


L  L 

l  b.[x(k-£-j/k),  k-£-j]  =  l  f,[x(k-£-j),  k-£-j/k] 
j=o  J  j=o  J 

n  N  „ 

*  kLi  l  h  .[x (k-j+1 )> k-j+1 /k] 

k+1  j=o  J 

(4.3.5) 

where 

f  j[x(k-£-j ) ,  k-£- j/k]  =  E|<{fJ.[x(k-£-j),  k-A-j]} 


r*' 
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and 

hj[x(k-j+1 ) ,  k-j+l/k]  =  E|<{hj[x(k-j+l ) ,  k-j+1]} 

Substituting  Equation  (4.3.5)  into  (4.3.4),  the  smoothing 
algorithm  becomes 


x(k-£+l/k+l)  =  x(k-£+l/k)  +  K?. -,{y(k+l )  -  l  h.[x(k-j+l ), k-j+l/k]} 

k  1  j=o  J 

(4.3.6) 

=  0 ,  1 ,  2 ,  ...»  and  0  <  i  <  k  +  1 , 


N 


for  k 
where 


x(k-£+l/k)  =  l  f.[x(k-£-j),  k-£-j/k] 
j=o  J 


(4.3.7) 


Substitution  of  Equation  (4.3.6)  into  (4.3.4),  yields 


x(k-Jt+l/k+l )  =  x(k-£+l/k)  -  K^+i {  \  hj[x(k-j+l  )/k] 


N 


Oi 


J=0 


+  v(k+l ) } 


(4.3.8) 


where 


h - [x ( k-j+1 )/k]  =  h  •  [x ( k-j+1 ) ,  k-j+1]  -  h  * [x ( k-j+1 ), k-j+l/k] 
j  J  u 


Therefore,  the  smoothing  error-variance  equation  is  given  by 


N  T 

V  (k-£+l/k+l )  =  V  (k-£+l/k)  -  E.  {x(k-£+l/k)  l  ft .[x (k-j+1 )/k] } 
x  X  j=0  J 


' 

’ 


* 
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[vjk 

x 

which 

from 


+ 


+ 


tT  .  N 

Kk+r  <+iEk 


Ci  Vk+1>  <+i 


,  N  N  „ 

‘‘•'Vf..  L  h. 


K 


k+1 


T 

j[x(k-j+l )/k]  x  (k-£+l/k)} 


(4.3.9) 


[x(k-i+l )/k]hj[x(k-j+l  )/k] } 


The  necessary  condition  for  minimizing  the  trace  of 

-£+l/k+l)]  is  provided  by  the  matrix  minimum  principle  [7],  for 

0 

K^-j  is  considered  as  the  control  variable,  and  is  obtained 
the  condition 


Trace[V  (k-£+l/k+l )]  =  [0] 

*\j 

x 


Hence,  the  gain  algorithm  is  given  by 


n  N 

Kk+1  =  Ekfx ( k-il+1  /k)  l  h  [x(k-j+l  )/k] } 

N  N  ^ 

[t  (k+1)  +  E Al  l  h.[x(k-i+l)/k] 

i=o  j=o 

h,[x(k-j+l )/k]}]_1 

J 


(4.3.10) 


and  Equation  (4.3.9)  is  reduced  to 


TOO 


Vjk-£+l/k+l)  =  V^(k-£+l/k)  (4.3.11) 

x  x 

0  N  *  J 

-  Ek{ X  hj[x(k-j+l )/k]  x(k-£+l/k)} 


Using  Equations  (4.2.1)  and  (4.3.7),  there  is  the  relation 


x(k-£+l/k)  =  7  f - [x ( k- £- j )/k]  +  G[x(k-£),  k-£]  w(k-£) 

j=o  J 

where 

fj[x(k-£-j)/k]  =  fj[x(k-£-j),  k-£-j]  -  fj[x(k-£-j),  k-£-j/k] 


Then  the  following  recursive  smoothing  algorithms  can  also  be 

derived 


L  L  * 

V^(k-£+l/k)  =  l  l  Ek(f  . [x(k-£-i )/k]  f  - [x ( k-£- j )/ k] > 
x  i=o  j=o  J 

+  G[x(k-£),  k-£]  ¥((k-£)  GT[x(k-£),  k-£]  (4.3.12) 

w 


and 

V^(k-£+l,  k-m+1 ) 
x 


V^(k-£+l,  k-m+l/k) 
x 


(4.3.13) 


.  N  ^  T 

Kk+!  Ek {l  hj [x (k- j+1 ) / k ] x  (k-m+l/k)} 


for  0  <  £,  m  <  k+1 . 


■ 
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For  k  <  0,  there  are  no  inputs  to  the  smoother  and  therefore 
x ( - j/-l )  is  set  to  zero  for  j  =0,  1,  . ..,  L,  which  in  turn  leads  to 

x(-j/-l)  =  x(-j) 

and  Vj-j,  -V-l)  =  V  (j,£) 

x 

for  j ,  £  =  0,  1 ,  ...»  L. 

As  shown  in  Section  4.2,  the  fixed-lag,  fixed-point,  fixed- 
interval  smoothing  and  also  the  filtering  algorithms  together  with 
their  error-variance  algorithms  follow  immediately  from  Equations  (4.3.6), 
(4.3.10)  to  (4.3.13)  by  properly  substituting  the  indices  k  and  £. 

And  in  the  particular  case  of  linear  message  and  measurement  models, 
the  results  presented  for  the  fixed-lag  smoothing  estimate  can  be  shown 
to  be  equivalent  to  those  of  Shukla  and  S ri na th[ 56] ,  whereas  for  linear 
problems  with  delays  only  appearing  in  the  message  model,  the  above 
algorithms  agree  well  with  those  of  Priemer  and  Vacroux  [37,38]. 

Also  notice  that  in  the  case  of  nonlinear  systems,  the 
preceeding  algorithms  require  infinite  dimensional  systems  to  realize 
the  various  terms  involving  the  expextations . 

In  order  that  the  smoother  can  be  physically  realized,  it  is 
assumed  that  the  conditional  probability  density  functions  of  the 
smoothing  errors  are  Gaussian,  and  Equation  (4.3.4)  is  used  to  substitute 
for  the  state  variables  by  the  sum  of  their  estimators  and  smoothing 
errors.  This  is  particularly  significant  for  systems  with  product  types 


. 
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or  polynomial  nonlinearities,  since  in  such  cases,  the  smoothing 
algorithms  can  be  evaluated  without  much  difficulty  and  are  physically 
realizable  without  any  further  approximation. 

On  the  other  hand,  the  majority  of  other  finite  dimensional 
filters  require  the  Taylor's  series  expansion  for  the  system  non- 
linearities,  in  general,  they  lead  to  the  deletion  of  some  of  the 
higher  order  terms  of  the  estimators  and  the  error-covariances. 

Moreover,  the  smoothing  estimators  are  already  required 
to  be  unbiased,  therefore,  the  expectations  that  are  in  Equations 
(4.3.10)  to  (4.3.13)  can  be  further  simplified  as 

*  N  ,J 

E,{x (k-t+l/k)  l  h.[x(k-j+l )/k]> 

K  j=o  J 

N  T 

=  E. (x(k-£+l/k)  y  h .  [x(k-j+l ) ,  k-j+1]}  (4.3.14) 

j=o  J 

N  * 

E.  h - [x (k-i+1  )/k]  h . [x (k-j+1  )/k] } 

k  i,j=o  1  J 

N  T 

=  E  .{£  h - [x (k-i+1 ) ,  k-i+1]  h  -  [x ( k-j+1 ) ,  k-j+1]} 
i » J=o 

N  A  -T 

-  I  h - [x ( k-i+1 ) ,  k-i+1 / k]  h.[x(k-j+l),  k-j+l/k]  (4.3.15) 
i »j=o  J 


and 
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Ml  f .[x(k-Ji-i)/k]  f  *[x(k-£-j ) / k] } 
k  i,j=o  1  J 

L  T 

=  Ml  [x( k-£-i ) ,  k-£-i ]  f  [x(k-£-j),  k-£-j]} 

i  ,j=o  1  J 


■  l  fi[x(k-£-i),  k-£-i/k]  f.[x(k-£-j),  k-£-j/k]  (4.3.16) 

i » j=o  1  J 

4.3.3  Illustrative  Example  cmd  Discussion 

In  order  to  illustrate  the  use  of  the  preceeding  algorithms 
consider  the  following  scalar  nonlinear  smoothing  problem 

x(k+l)  =  x3(k)  +  2x(k-l)  +  w(k) 
y(k)  =  x2(k)  +  x(k-l )  +  v(k) 

where  w(k)  and  v(k)  are  independent  Gaussian  noise  processes  with 
variances  Mk)  and  T  (k),  respectively.  The  initial  conditions  x(0) 

W  V 

A 

and  x ( - 1 )  are  zero-mean  Gaussian  with  respective  moments  x(0/0)  and 

A 

x(-l/0),  and  are  independent  of  both  v(k)  and  w(k),  and  also 

E( X ( - j  )  xT(-£)}  =  V^(j,£) 

x 

for  j ,  £  =  0,  1 . 

The  expectations  of  Equations  (4.3.14)  to  (4.3.16)  can  be 
obtained  as  fol lows  respectively 
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Ek{x(k-£+l/k)  [x2(k+l)  +  x(k)]} 

2 

=  Ek{x(k-£+l/k)  [x  (k+l/k)  +  2x(k+l/k)  x(k+l/k) 
-2 

+  x  (k+l/k)  +  x ( k )  +  x(k)]} 


2  Vjk-£+l,  k+l/k)  x ( k+1 / k )  +  Vjk-a+1,  k/k). 
x  x 


(4.3.17) 


Ek{[x2(k+1)  +  x(k)]2}  -  [Ek(x2 ( k+1 )  +  x(k)}]2 


Ek(x^(k+1)  +  2  x2(k+l)  x(k)  +  x2(k)} 


-  [V  (k+l/k)  +  x  (k+l/k)  +  x(k)]‘ 
x 


3  \r(k+l/k)  +  6  x  (k+l/k)  Vj  k+l/k)  +  x  (k+l/k) 
x  x 


+  2[2V  (k+1, k/k)  x( k+l/k)  +  V  (k+l/k)  x(k)  +  x  (k+l/k) 

'"b  'b 


x ( k ) ]  +  Vjk)  +  x  (k) 
x 


-  [V  (k+l/k)  +  x  (k+l/k)  +  x(k)]‘ 


2  v  (k+l/k)  +  4  V  (k+l/k)  x  (k+l/k) 

'b  '"b 

X  X 


+  4  V  (k+1, k/k)  x ( k+1 / k )  +  V  (k) 

r b  '•* 

x  X 


(4.3.18) 


and 
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Ek{[x3(k-£)  +  2  x(k-£-l)]2}  -  [Ek{x3(k-£)  +  2  x(k-£-l)}]2 


=  Ek(x°(k-£)  +  4  x3(k-£)  x(k-£-l)  +  4  x2(k-£-l)} 


-  [3  Vjk-£/k)  x(k-£/k)  +  x  (k-£/k)  +  2  x(k-£-l/k)]2 
x 


=  15  V3(k-£/k)  +  36  V2(k-£/k)  x  (k-£/k) 
x  x 

.4 

+  9  Vjk-£/k)  x  (k-£/k)  +  12  V%(k-£,k-£-l/k) 
x  x 

.2 

•  V^(k-£/k)  +  12  Vjk-£,  k-£-l/k)  x  (k-£/k)  (4.3.19) 

x  x 


+  4  Vjk-£-l/k) 
x 


Then  Equations  (4.3.6),  (4.3.10)  to  (4.3.13)  can  be  obtained 
with  the  substitution  of  Equations  (4.3.17)  to  (4.3.19).  In  particular 
when  £=0,  Equations  (4.3.6),  (4.3.10)  to  (4.3.13)  directly  yield  the 
nonlinear  filtering  algorithms.  Namely, 


i(k+l/k+l)  =  x(k+1/k)  +  K°+1(y(k+l)  -  VJk+l/k) 


x2 

-  x  (k+l/k)  -  x(k) } 


where 


' 


x(k+l  )/k)  =  3V^(k)  x(k)  +  x  (k/k)  +  2  x(k-l/k) 
x 


K°+,  =  [2VJk+l/k)  x(k+1/k)  +  V  (k+1 ,  k/k)] 
x  x 


•  (k+1 )  +  2V^(k+l/k)  +  4Vn  (k+l/k)  x  (k+l/k) 

x  x 


+  4Vjk+l,  k/k)  x(k+l/k)  +  V  (k)]"1 
x  x 


Also, 


VJk+l/k+1)  =  Vjk+l/k)  -  K°+1[2Vjk+l/k)  x { k+l/k) 

XX  X 


+  vjk+l,k/k)] 

X 


and 

Vjk+l/k)  =  15V^(k)  +  36V^(k)  x  (k-£/k) 

X  XX 


.4 

+  9V^(k)  x  (k)  +  12Vjk,k-l/k)  V  (k) 

X  XX 


,2 

+  12Vjk,k-l/k)  x  (k/k)  +  4V%(k-l/k)  +  y  (k) 

X  X 


The  fixed-lag  smoothing  algorithms  are  obtained  from  Equations 
(4.3.8),  (4.3.10)  to  (4.3.13),  with  the  substitution  k  and  N  for  k+1  and 
£,  respectively. 


. 


107 


2 

x(k-N/k)  =  x(k-N/k-l)  +  K^{y(k)  -  Vjk/k-1)  -  x  (k/k-1) 

x 


-  x(k-l)} 

where 

x ( k-N/ k- 1 )  =  3V^(k-l-N/k-l )  x(k-l -N/k-1 ) 

x 

A3 

+  x  ( k- 1 -N/k-1 )  +  2  x( k-2-N/k-l ) 
and 

=  [2Vjk-N, k/k-1)  x(k/k-l)  +  V  Jk-N,k-l/k-l ) 
x  x 


•  [Tv(k)  +  2\r  (k/k-1 )  +  4V^(k/k-l )  x  (k/k-1) 
x  x 


+  4Vjk,k-l/k-l)  x(k/k-l)  +  Vjk-l)]'1 

X  X 

Also 

VJk-N/k)  =  VJk-N/k-1)  -  K^Vjk-N, k/k-1)  x( k/k-1 ) 

XX  X 


+  V  (k-N,  k-l/k-1 )] 
x 


In  a  similar  manner,  the  fixed-interval  as  well  as  the  fixed- 
point  smoothing  algorithms  can  be  obtained  from  Equations  (4.3.6),  (4.3.10) 
to  (4.3.13),  (4.3.17)  to  (4.3.19)  by  properly  substituting  the  indices 
k  and  z.  As  a  consequence,  the  effectiveness  of  the  proposed  algorithms 
is  illustrated. 


Notice  that  in  the  above  evaluation,  the  only  assumption  needed 
is  the  Gaussian  assumption  of  the  estimator  error.  And  it  can  be  easily 
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shown  that  if  the  system  nonlinearities  were  expanded  in  terms  of  the 
first  or  second-order  Taylor's  series,  the  higher  order  terms  of  the 
error-covariances  as  well  as  the  estimators  would  have  been  dropped. 

Which  when  applied  to  estimation  problems  with  relatively  large 
initial  error  covariances,  may  lead  to  erroneous  results. 

To  qualitatively  assess  the  performance  characteristics 
of  the  proposed  nonlinear  estimation  algorithms,  extensive  numerical 
simulation  and  comparative  studies  must  be  carried  out  to  compare 
the  above  estimators  with  those  of  other  finite  dimensional  filters, 
however,  this  is  not  included  in  this  section. 

4.4  CONCLUSION 

In  this  chapter,  discrete  estimation  algorithms  are  derived  for 
nonlinear  multiple  delayed  systems  with  single-channel  non-delayed 
measurement  as  well  as  measurements  with  multiple  delays,  imbedded  in 
additive  white  noise  processes.  The  results  are  new,  recursive  in 
nature  and  directly  yield  the  fixed-interval,  fixed-point,  fixed- 
lag  smoothing  and  the  filtering  algorithms.  The  derivation  is 
s trai ghtf orward  and  clearly  shows  the  close  link  between  the  smoothing 
and  filtering  estimation  algorithms. 

The  presented  algorithms  are  physically  realizable  under 
the  assumption  that  the  probability  density  functions  of  the  estimator 
errors  are  Gaussian,  they  can  be  obtained  without  any  further  approx¬ 
imation  for  systems  with  product  types  or  polynomial  nonlinearities. They 
are  expected  to  be  computationally  faster  than  applying  an  extended 


Kalman  filter  to  an  augmented  system  with  no  explicit  delay  times. 


In  the  particular  case  of  linear  system  models,  the 
smoothing  algorithms  are  stable,  and  for  the  nonlinear  case  the 
stability  behavior  needs  to  be  further  investigated;  but  it  is  easy 
to  observe  that  the  basic  structure  of  the  nonlinear  estimator 
is  the  same  as  the  linear  counterpart,  hence  there  is  a  good  reason 
to  believe  that  the  corresponding  nonlinear  estimator  will  be  stable 
provided  the  original  message  model  is  also  stable. 

The  results  derived  in  this  thesis  can  be  extended  to 
continuous  time  problems  through  a  formal  limiting  procedure  [4]. 

The  presented  approach  can  also  be  extended  to  nonlinear  distributive 
systems  with  or  without  delays. 


■ 


CHAPTER  V 


MINIMUM  VARIANCE  FILTERING  AND  SMOOTHING  FOR  NONLINEAR 
SYSTEMS  WITH  CORRELATED  NOISES 

5 . 1  INTRODUCTION 

In  Chapter  IV,  the  noise  processes  considered  are  assumed 
to  be  Gaussian  white  and  mutually  independent,  however,  in  practical 
situations  such  assumptions  are  invalid,  since  in  physical  systems, 
independent  white  noise  processes  simply  do  not  exist.  Therefore,  it 
is  natural  to  extend  the  new  estimation  technique  developed  in 
Chapter  IV  to  more  realistic  problems,  where  message  noise  processes 
are  correlated  with  measurement  noise  processes. 

The  treatment  of  linear  estimation  problems  for  systems 
without  delays  imbedded  in  correlated  noise  processes  can  be  handled 
by  the  uncorrelated  algorithms  if  one  can  uncorrelate  the  message  and 
measurement  noise  processes.  Recently,  Raja  Rao  and  Mahalanabis  [57] 
derived  estimation  algorithms  for  linear  systems  with  delay  imbedded 
in  correlated  noise  processes,  however,  their  results  appear  to  have 
a  fundamental  mistake  in  the  procedure  given,  which  leads  to  self¬ 
contradictory  results;  this  is  reported  in  [58]. 

To  .the  author's  knowledge  estimation  algorithms  have  not 
been  derived  for  nonlinear  systems  with  or  without  delays  imbedded  in 
correlated  noise  processes,  this  is  due  to  the  inherent  difficulty  in 
finding  the  conditional -mean  estimate  equation  and  the  associated  error- 
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variance,  when  the  noise  processes  are  correlated. 

In  Section  5.2,  new  recursive  estimation  algorithms  are 
derived  for  discrete  nonlinear  multiple  delays  systems  with  non- 
delayed  measurements  imbedded  in  correlated  noises.  Handy  sets  of 
reference  equations  are  provided  for  each  specific  case  of  smoothing 
and  also  the  filtering  estimation;  general  linear  estimation  algorithms 
are  also  provided. 

In  Section  5.3,  new  recursive  estimation  algorithms  are 
derived  for  discrete  nonlinear  delayed  systems  with  multi-channel 
measurements  involving  time  delays  and  imbedded  in  correlated  noise 
processes.  An  example  is  given  to  illustrate  the  use  of  the  proposed 
smoothing  algorithm,  and  also  derived  are  the  general  linear  estimation 
algorithms. 

In  this  chapter,  the  basic  technique  makes  use  of  the  matrix 
minimum  principle  to  derive  the  optimal  values  of  the  coefficients 
in  the  estimation  algorithms  under  the  requirements  that  the  estimates 
be  unbiased  and  the  error-variance,  which  is  taken  to  be  the  estimation 
criterion,  is  minimized;  they  are  reported  in  [59,60]. 

5.2  NONLINEAR  DISCRETE  DELAYED  SYSTEMS  WITH  NON- DELAYED  MEASUREMENTS 
IMBEDDED  IN  CORRELATED  NOISE  PROCESSES . 

In  this  section,  estimation  algorithms  are  derived  for  non¬ 
linear  discrete  delayed  systems  with  non-del ayed  measurements  imbedded 
in  correlated  noise  processes. 
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Section  4.2.1  presents  the  problem  statement.  Section  4.2.2  deals 
with  the  derivation  of  the  smoother.  In  Section  4.2.3  to  4.2.6,  non¬ 
linear  estimators  together  with  their  error-variance  equations  are  obtained 
for  the  fixed-lag,  fixed-point,  fixed-interval  smoothing  and  the 
filtering  estimation,  respectively.  Section  4.2.7  presents  the  general 
smoothing  algorithms  for  linear  discrete  delayed  systems,  which  can  be 
easily  converted  to  yield  the  fixed-lag,  fixed-point,  fixed-interval 
smoothing  and  the  filtering  algorithms,  by  properly  substituting 
the  time  indices. 

5.2.1  The  Problem  Statement 

The  state  equation  of  the  discrete  nonlinear  time-delayed 
systems  under  consideration  is  described  by 

L 

x(k+l)  =  7  f-[x(k-j),  k-j]  +  G[x(k),  k]  w(k)  (5.2.1) 

j=o  J 

and  the  measurement  is  given  by 

y(k)  =  h[x(k),  k]  +  v(k)  (5.2.2) 

where  k  =  0,  1 ,  2,  . . . ,  is  the  discrete  time  index,  the  state  x  is 

an  n-vector;  the  measurement  y  an  m-vector;  the  state  noise  sequence 

w  an  r-vector;  the  measurement  noise  v  an  m-vector;  G,  a  nonlinear 

state  dependent  nxr  matrix.  The  nonlinear  vector  valued  functions 

f.  and  h  are,  respecti vely,  n  and  m  dimensional. 

J 


■ 
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The  noise  sequences  w  and  v,  are  zero-mean  white  Gaussian 
with  non-negative  definite  covariance  y  and  positive  definite  y  , 

W  V 

respectively.  Also 

E{v(k)  wT(j)}  =  Tyw  (k)  «kjj 

for  all  integers  k  and  j,  where  E  denotes  the  expected  value,  6,  . 

K ,  J 

the  Kronecker  delta,  and  y  is  non-negative  definite. 

vw  3 

The  initial  states  x(0)  and  x(-j),  for  j  =  1 . L  are 

zero-mean  Gaussian  random  vectors,  which  are  independent  of  v(k)  and 
w(k),  with  a  positive  definite  covariance  matrix 

E(x(-j)  xT  (-£)}  =  Vx(j,£) 

for  j ,  £  =  0,  1 ,  ...»  L. 

The  smoothing  problem  is  to  obtain  the  unbiased  smoothed 

A 

estimate  x(k-£+l /k+1 )  of  the  state  x(k-£+l),  where  0  <_  £  £  k+1 , 
conditioned  on  the  set  of  measurements 

Y ( k+ 1 )  =  {y (0) »  y(l ),  ...,  y ( k+1 ) } 

such  that  the  cost  function 

J(k+1)  =  Trace[M(k)  V^(k-£+l/k+l )] 

x 


(5.2.3) 


is  a  minimum.  Here  M(k)  is  some  non-negative  definite  weighting  matrix, 

and  V^(k-£+l/k+l )  is  defined  by 
x 

Vjk-£+l/k+l)  =  E. +, [{x (k-£+l )  -  x(k-£+l/k+l ) } 
x  K 

(x(k-£+l)  -  x(k-£+l /k+1 )}T]  (5.2.4) 

where  E^-jW  denotes  the  expectation  operation  conditioned  on  the 
set  of  measurements  Y(k+1). 

5.2.2  The  Derivation  of  the  Smoother 

With  reasoning  similar  to  that  of  Section  4.2.2,  the  smoothed 
estimate  is  assumed  to  be  constrained  by  the  nonlinear  dynamic  equation* 

L  „ 

x ( k-£+l /k+1 )  =  l  bj[x(k-A-j/k),  k-£-j]  +  Kj^+1  y(k+l)  (5.2.5) 


*  It  can  be  shown  that  when  other  dynamic  equations  such  as 

L  .  - 

x( k-£+l /k+1 )  =  l  bJ[x(k-£-j/k),k-£-j]+K*+1{y(k+l)-Ek[y(k+l)]} 


is  assumed,  the  resulting  smoothing  estimator  would  be  exactly  the 
same  as  those  using  Equation  (5.2.5). 
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L 

where  £ 
j=o 


bj[x(k-£-j/k) ,k-£-j]  and  k£+1  are  yet 
Let  x(-£+l/k+l)  denotes  the  smoothing 


to  be  determined, 
error  defined  by 


x(k-£+l/k+l )  =  x(k-£+l)  -  x(k-£+l/k+l) 


then  we  obtain, 


L 

x(k-£+l/k+l)  =  l  f.[x(k-£-j),  k-£-j]  +  G[x(k-£),  k-£] 

j=o  J 

L 

•  w(k-£)  -  l  b  - [x ( k-£- j /k) ,  k-£-j]  (5.2.6) 

j=o  J 


-  K^+1{h[x(k+l),  k+1]  +  v ( k+ 1 ) } 


Since  it  is  required  that  the  smoothed  estimate  be  unbiased, 
it  is  necessary  that 


l  b.[x(k-£-j/k),  k-£-j]  =  I  f,[x(k-£-j),  k-£-j/k] 
j=o  J  j=o  J 


-  k£+1  h[x ( k+1 ) ,  k+l/k]  (5.2.7) 


where 


fj[x(k-£-j),  k-£-j/k]  =  Ek(f j[x(k-£-j ) ,  k-£- j ] } 


and 


■ 
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h[x(k+l),  k+l/k]  =  Ek(h[x(k+1),  k+1]} 

With  Equation  (5.2.7)  substituted  into  (5.2.5),  the  smoothed 
estimate  becomes 


x(k-£+l/k+l)  =  x(k-£+l/k)  +  Kk+]{y(k+l) 


-  h[x(k+l ) ,  k+l/k]} 


(5.2.8) 


for  k  =  0,  1,  2,  ...»  and  0  <_  i  <_  k+1. 
Where 


x(k-£+l/k)  =  l  f.[x(k-£-j),  k-£-j/k]  (5.2.9) 

j=o  J 

Equation  (5.2.6)  is  now  simplified  as 


x(k-i+l/k+l)  =  x(k-t+l/k)  -  K^+1  {h[x(k+l  )/k]  +  v(k+l)}  (5.2.10) 


with 

ft[x(k+l)/k]  =  h[x(k+l),  k+1]  -  h[x(k+l),  k+l/k] 


Then,  the  error-variance  equation  is  given  as 


V^k-A+l/k+l)  =  Vjk-l+l/k)  -  K*+,Ek{h[x(k+l)/k] 

X  X 
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•  x  (k-n+l/k)}  -  Ek{x(k-£+l/k)  h  [x(k+l)/k]} 

'  <Kk+l>T+  Kk+1  Vk+1><Kk+i>T 

+  K^+1Ek{h[x(k+l)/k]  h[x(k+l)/k]}(Kk+1)T 

(5.2.11) 


The  necessary  condition  for  minimizing  the  trace  of 

V  ( k-£+l/k+l),  can  now  be  obtained  from  the  condition 
x 


Trace[V  (k-t+l/k)]  =  [0] 
x 


Hence,  the  gain  algorithm  K 


£ 

k+1 


is  given  by 


=  E(<{x(k-Jt+l/k)  h  [x(k+l)/k]} 


[>v(k+1)  +  Ek(h[x(k+1 )/k]  h  [x(k+l)/k]>] 


(5.2.12) 


and  Equation  (5.2.11)  becomes 


Vjk-£+l/k+l)  =  V^(k-*.+1/k)  -  K*+1Ek(h[x(k+l)/k] 
X  x 

T 

x  (k-n+l/k)} 


(5.2.13) 
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and  if  the  matrix  VJ k-£+l ,  k-m+l/k+1)  is  defined  as 

x 


V^(k-£+l ,  k-m+l/k+1) 
x 


Ek+1{x(k-£+l/k+l)x (k-m+l/k+1)} 


then 


V^(k-£+l ,  k-m+l/k+1)  =  V^(k-£+l ,  k-m+l/k) 
x  x 

-K^+1Ek{h[x(k+l)/k]x(k-m+l/k)}  (5.2.14) 


for  0  <  £,  m  <  k+1 . 


Now,  there  remains  the  problem  of  evaluating  V  (k-n+l/k) 


in  Equation  (5.2.13). 


x 


Even  though  substracting  Equation  (5.2.9)  from  Equation  (5.2.1) 
would  easily  yi el d 


x(k-£+l/k)  = 


L  f\j 

l  f  .[x(k-£-j  )/k]  +  G[x(k-£),  k-£]  w(k-£) 

A-r,  J 


where  it  is  defined  that 

fj[x(k-£-j)/k]  =  fj[x(k-£-j),  k-£-j] 

A 

-  fj[x(k-£-j),  k-A-j/k] 
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it  is  seen  that  the  error-vari ance  equation  can  not  be  obtained  by 
taking  the  expectation  of  Equation  (5.2.15)  multiplied  by  its  own 
transpose,  since  the  expectation  of 


[x(k-£-j )/k]  wT(k-£)} 


can  not  be  explicitly  evaluated. 

On  the  other  hand,  following  the  estimation  technique  presented 
above,  the  unbiased  estimate  of  the  state  x(k-£+l)  can  be  obtained  as 

x(k-£+l/k)  =  l  f,[x(k-£-j),  k-£-j/k-l] 
j=o  J 

+  K^(y(k)  -  h[x(k) ,  k/k-1 ]} 


which  in  turn  leads  to 

^  L  * 

x(k-£+l/k)  =  l  f.[x(k-£-j),  k-£- j /k- 1 ]  (5.2.16) 

j=o  3 

-  K^{h[x(k)/k-l]  +  v(k)}  +  G[x(k-£) ,k-£]w(k-£) 

The  optimal  value  of  the  matrix  can  be  determined  from 
the  necessary  condition  that 


— -  trace  [M ( k )  V^(k-£+l/k)]  =  [0]  (5.2.17) 

3<k  x 

Since  V  (k-£+l/k)  is  the  expectation  of  Equation  (5.2.16) 
x 
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multiplied  by  its  own  transpose,  then  using  Equation  (5.2.19),  the 
result  is 


and 


„  L  * 

<k  =^E|<-1{  l  fj[x(k-£-j)/k"1]  h  Cx(K)/K-lD> 

+  G [x ( k- £ ) ,  k-£]  \v(k)  6k,k-£^  (5.2.18) 

•  v ( k )  +  Ek_1(h[x(k)/k-l]  h  [x(k)/k-l]}]_1 
V^(k-M-l/k)is  simply 


V^(k-£+l/k) 


I  E.  ,{f  .[x(k-£-i)/k-l]  f  .[x(k-£-j)/k-l]} 
i ,j=o  K"'  1  J 


+  G[x(k-t) ,k-£]f  ,( k-£)GT[x(k-£) ,k-£]  (5.2.19) 

w 


-  4[V(k-*)G'[x<k-*>>k-*]  4ktk.t 


J 


L  rj 


+  E.  1{h[x(k)/k-l]  l  f i[x(k-A-j )/k-l ]}] 
k-i  3=0  J 


also 


L  ~  <\,T 

V  (k-Jt+1 ,  k-m+1  /k)  =  7  E.  1{f,[x(k-ii-i)/k-l]f .[x(k-m-j)/k-l]} 
x  i,j=o  k_l  1  J 


+  G[x(k-£),k-£]^w(k-£)6|<_jl}k_mGT[x(k-m)  ,k-m] 
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"  Kk^vw^k"m^6k,k-mGT^x^k"m^k"m^  (5.2.20) 

+  E.  1(h[x(k)/k-l]  I  f  - [x ( k-m-j  )/k-l ] }] 

j=o  J 

for  0  <_  £,  m  £  k+1 . 

When  the  discrete  time  index  k<0,  there  is  no  input  to  the 

A 

smoother  and  therefore  x ( - j /- 1 )  is  set  to  zero  for  j  =  0,  1,  ...»  L, 
which  results  in 


x (- j/-l )  =  x(-j) 

and 

Vj-j.-sy-l)  =  Vx(j,Jt) 

X 

for  j ,  £  =  0,  1 ,  ...»  L. 

On  the  other  hand,  since  the  smoothing  estimator  is  unbiased, 
the  expectations  that  are  in  Equations  (5.2.12)  to  (5.2.14),  (5.2.18)  to 
(5.2.20)  can  be  replaced  by  Equations  (4.2.20)  to  (4.2.22)  and  also 

Ek_1{fi[x(k-£-i)/k-l]  f j[x(k-m-j )/k-l ]}  (5.2.21) 

=  Ek_i {f i [x ( k-i£-i  ),k-£-i]  f j [x ( k-m-j  ) ,  k-m-j]} 


.T 

-  f  -[x(k-£-i),  k-£-i/k-l]  f  .[x(k-m-j) 9 k-m- j/k- 1 ] 

J 

It  must  be  noted  that  in  the  case  of  nonlinear  systems  the 
above  expectations  require  infinite  dimensional  systems  to  realize. 

Therefore,  as  an  approximation,  it  is  assumed  that  the 
conditional  probability  density  functions  of  the  smoothing  error  x 
are  Gaussian.  It  is  important  to  note  that  under  such  an  assumption, 
the  algorithms  presented  in  this  chapter  can  be  physically  realized 
without  any  further  approximation  for  systems  with  product  types  or 
polynomial  nonlinearities. 

The  estimation  algorithms  presented  here  are  not  only  new 
and  recursive  in  nature,  but  also  directly  yield  the  fixed-lag,  fixed- 
point,  fixed-interval  smoothing  as  well  as  filtering  algorithms,  this 
is  done  by  the  proper  substitution  of  indices  £  and  m. 

5.2.3  The  Nonlinear  Fixed-Lag  Smoothing 

When  k  and  N  are  substituted  in  place  of  k+1 ,  and  £,  into 
Equations  (5.2.8),  (5.2.9),  (5.2.12)  and  (5.2.13)  and  also  Equations 
(5.2.18)  to  (5.2.20),  where  k>N,  the  results  obtained  are  the  recursive 
nonlinear  fixed-lag  smoothing  algorithms;  they  are  respectively,  as 
follows 


x(k-N/k)  =  x ( k-N/k- 1 )  +  K^{y(k)  -  h[x(k),  k/k-1]} 


where 
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x ( k-N/k)  =  l  f  - [x ( k- 1 -N- j )  ,k-l-N-j/k] 
j=o  J 

<  =  Ek_-|{x( k-N/k-1 )  hT[x(k)/k-l]} 


[vFv(k)  +  Ek_i {h[x( k)/k-l ]  h  [x(k)/k-l]}]_1 


(k-N/k )  =  ( k-N/k-1 )  -  K^E k_ -, { h [x ( k ) / k- 1  ] 


T 

•  x  (k-N/k-1)} 


N 

Kk-1 


L  * 

Ek_2 {l  f j[x(k-l-N-j)/k-2]  h  [x(k-l )/k-2] } 


+  G[x(k-1-N),  k-l-N]  ^wv(k-l)  5k.]  jk_-j _^] 


J>v(k-1)  +  Ek_2(h[x(k-l)/k-2]  h  [x (k-1  )/k-2] }] 


-1 


L  a, 

k-N/k-1)  =  I  Ek_2{f.[x(k-l-N-i)/k-2] 
x  i  ,j=o 

■J 

•  fi[x(k-l-N-j)/k-2]}  +  G[x(k-1-N), k-l-N] 

J 


•  fw(k-l-N)GT[x(k-l-N), k-l-N]  -  <k.1['fvw( k-l-N) 


•  GT[x(k-l-N),k-l-N]6k_1>k_1_N  +  Ek_2 


{h[x(k-l )/k-2] 


"f 
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L  a. 

*  I  f [x(k-l -N-j )/k-2] }] 
j=o  J 


Vjk-£,k-m/k-l)  =  l  E.  2{f,[x(k-£-i-l)/k-2] 
x  1  ,  j=o 

nj 

f .[x(k-m-j-l )/k-2]}+G[x(k-l -£) ,k-£-l]y ( ( k- 1  -£ ) 

J  w 


-meT[x  (k-1_rn)»k-1  -m] 


Kk-4‘i'vw^k'1‘m^k-l>k-l-mGT^x^k"1'm),k‘1'm^ 


+  Ek_2{h[x(k-l)/k-2] 


I  f{[x(k-l -m-j )/k-2] }] 
j=o  J 


(5.2.22) 


5.2.4  The  Nonlinear  Fixed-Point  Smoothing 


When  £  =  k  -  N+l  (where  k+1 >N )  is  substituted  into  Equations 
(5.2.8),  (5.2.9)  -  (5.2.12),  (5.2,13)  and  (5.2.18)  -  (5.2.19),  then  the 
recursive  fixed-point  smoother  is  given  by  the  following 


x(N/k+l )  =  i(N/k)  +  Kk;!J+1  {y ( k+ 1 ) 


-  h[x(k+l ) ,k+l/k] } 


L  A 

x(  N/k)  =  7  f-[x(N-l-j),  N-l-j/k] 

j=o  J 


. 


125 


Kk+r1=  Ek{*(  N/k >  hWn/k]} 


[>v(k+l)  +  Ek(h[x(k+l)/k]  h  [xtk+D/k]}]'1 


V^(N/k+l )  =  V  (N/k)  -  E.  {h[x(k+l  )/k] 

X  X 


•  x  (N/k)} 


* 

VjN/k)  =  l  E.  ^{f -[x(N-l-i  )/k-l]  fJx(N-l-j)/k-l]} 
x  i,j=o  J 

+  G[x(N-l )  ,N-1  ]  ^..(N-l  )GT[x(N-l )  ,N-1] 

w 


-  Kk"N+1['l'vw(N'1)GT[x(N"1)’N-,]  6k,N-l 


+  Ek_-j{h[x(k)/k-l]  l  f - [x(N-l -j )/k-l ] }] 


Kk"N+1=  [Ek.i^txtN-l-jJ/k-l]  hT[x(k)/k-l]}  +  G[x(N-l ) ,N-1  ]] 


\zv^)6k,N-l-^'Mk)+Ek-l^x^l<^k“^ 


•  h  [x(k)/k-l]}]_1 


and  V  (k-£,k-m/k-l )  is  the  same  as  that  of  Equation  (5.2.22). 
x 


5.2.5  The  Nonlinear  Fixed- Interval  Smoothing 


Setting  k+l=N,  £=N-k,  where  N^k,  then  Equations  (5.2.8), 
(5.2.9),  (5.2.12)  -  (5.2.13)  and  (5.2.18)  -  (5.2.19),  are  the 
recursive  algorithms  for  fixed-interval  smoothing.  They  are 

x  ( k/N )  =  x(k/N-l )  +  K{j'k{y(N)  -  h[x(N),N/k]} 

x ( k/N-1 )  =  l  f.[x(k-l-j),  k-l-j/N-1] 
j=0  J 

kjj'k  =  EN_1(x(k/N-l)  h  [x(N)/N-l]} 

[fv(N)  +  En_]{ h[x(N)/N-l]hT[x(N)/N-l]>]-1 

VJ k/N)  =  Vjk/N-1)  -  k[|'kEN_1{h[x(N)/N-l] 

X  X 

J 

x  (k/N-1)} 

L  a- 

VJ k/N-1)  =  l  EN,{f.[x(k-l-1)/N-2] 
x  i,j=o 

•  f, [x(k-l-j)/N-2]>  +  G[x(k-l),k-l]  T  (k-1) 

J 

•  GT[x(k-l),k-l]  -  K^k[fvw(k-l)GT[x(k-l),k-l] 

L  a 

^N-l.k-l  +  EN„2{f([x(N-l)/N-2]  l  fj[x(k-l-j)/N-2]}] 

j  ^ 


and 


'X, 


a. 


T 


kN-1  =  [EN-2{I  fi[x(k-l-j/N-2]  h  [x(N-1)/N-2]} 

j=o  J 


+  G[x(k-l),k-l]  ?WV(N-1)  M] 


[fv(N-l)  +  EN_2{h[x(N-l)/N-2]  h  [x(N-l)/N-2]}] 


-1 


5.2.6  The  Nonlinear  Filtering 

When  £  is  set  to  zero*  Equations  (5.2.8),  (5.2.9),  (5.2.12), 
(5.2.13)  and  (5.2.18)  -  (5.2.19)  become  the  filtering  algorithms. 
Namely: 


,0 


x(k+l/k+l )  =  x(k+l/k)  +  Kk+1(y(k+l)  -  h[x(k+l )  ,k+l/k]} 


x(k+l/k)  =  I  f  •  [ x ( k - j ) ,k-j/k] 


J=o 


r\j  I 


K°+1  =  Ek{x(k+l/k)  h  [x(k+l)/k]} 


T 


Ov(k+l)  +  Ek(h[x(k+1 )/k]  h  [x(k+l)/k]}] 


-1 


,0  _  . -  J 


Vjk+l/k+1)  =  Vjk+l/k)  -  Kk+1Ek(h[x(k+l)/k]  x  (k+l/k)} 

X  X 


V  (k+l/k)  =  £  E.  _-.{f ^ [x(k-i )/k-l]  fj[x(k-j)/k-l]} 
x  i  .j=o 


+  G[x(k),k]  fw(k)  G  [x(k) ,k] 
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-  ^vw<k> 


+  Ek.-j{h[x(k)/k-l]  l  f - [x ( k-j )/k-l ] }] 

j=o 


and  finally 

Kk  =  ^Ek-1{^  f Cx ( k-j )/k-l ]  h  [x ( k )/k-l ] }  +  G[x(k),k] 
j=o  J 

*  \v(k)][>v(k)  +  E|<_1{h[x(k)/k-l]  h  [x(k)/k-l ] }]_1 

5.2.7  Estimation  in  Linear  Discrete  Systems 

In  this  section,  general  linear  estimation  algorithms  are 
obtained  for  discrete  delayed  systems  with  non-del ayed  measurements 
corrupted  by  correlated  noise  processes.  The  algorithms  can  be 
easily  converted  to  yield  the  fixed-lag,  fixed-point,  fixed-interval 
smoothing  and  the  filtering  estimators  together  with  their  respective 
error- variance  equations.  This  is  done  by  making  the  proper  choice 
of  discrete  indices,  as  shown  in  Sections  5.2.3  to  5.2.6. 

For  linear  discrete  systems,  we  have 

E  L 

l  f,*  [x  ( k-j ) ,  k-j  ]  =  l  F.(k)  x(k-j) 
j=o  J  j=o  J 

h[x(k),k]  =  H(k)  x(k) 
and 


G[x(k),k]  =  G(k) 
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Then  the  general  linear  estimation  algorithms  are  obtained 
from  Equations  (5.2.8),  (5.2.9),  (5.2.12)  -  (5.2.13)  and  (5.2.18)  - 
(5.2.20),  by  making  the  above  replacement 


x(k-£+l/k+l )  =  x(k-£+l/k)  +  K^+1(y(k+l) 


-  H(k+1 )  x(k+l/k) } 


x(k-il+l/k)  =  l  F  .  (k-£)  x(k-£-j/k) 
j=o  3 


Kk+r  V^(k-£+l,k+l/k)  HT(k+l ) 


O  (k+1)  +  HT(k+l )  Vjk+l/k)  HT(k+l )] 

x 


Vjk-£+l/k+l)  =  Vjk-£+l/k)  -  K^+1  H(k+1) 
x  x 


Vjk+l,k-£+l/k) 

x 

L 

V  (k-£+l/k)  =  I  F  .  (k-£)  V^(k-£-i ,k-£-j/k-l ) 
x  i,j=o  1  x 


•  fT ( k-ii )  +  G(k-fi-)  ¥  (k-£)  GT(k-£) 

J 


-  ^[Tvw(k-£)  eT(k-£) 


L  T 

+  H(k)  X  Vjk,k-£-j/k-l)  Fl(k-A)] 
j=o  x  J 
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and 


+  G<k-£>  ^(k) 


sk,k-4 


•  [fv(k)  +  H(k)  Vjk/k-l)  HT(k)]'1 

X 


Vjk-A+1,  k-m+l/k)  =  I  F.  (k-Ji)V  (k-A-i  ,k-m-j/k-l )  Fl(k-m) 
x  i ,  j=o  x  J 


-  ^vW<k-m>5k,k-mGT<k-m>  +  l  H<k> 

J=0 


*  V^(k,k-m-j/k-l ) 

x 

•  Fj(k-m)] 


It  can  be  easily  shown  that  in  the  special  case  of  discrete 
systems  without  delay,  the  results  obtained  for  the  filtering  algorithm 
agree  well  with  those  in  the  literature  [49],  but  the  smoothing  results 
are  new. 

In  the  particular  case  of  linear  discrete  delayed  systems 
with  uncorrelated  noise  processes,  the  filtering  and  fixed-lag  smoothing 
algorithms  obtained  here  can  be  shown  to  agree  with  those  of  Priemer 
and  Vacroux  [37,38]. 
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5 . 3  NONLINEAR  DISCRETE  DELAYED  SYSTEMS  WITH  MEASUREMENTS  CONTAINING 
MULTIPLE -DELAYS  IMBEDDED  IN  CORRELATED  NOISE  PROCESSES 

In  this  section,  estimation  algorithms  are  derived  for  non¬ 
linear  discrete  delayed  systems  with  measurements  containing  multiple 
delays  imbedded  in  correlated  noises. 

In  Section  5.3.1,  the  problem  statement  is  presented, 

Section  5.3.2  derives  the  discrete  nonlinear  smoothing  algorithms, 
which  can  be  directly  applied  to  yield  the  fixed-lag,  fixed-point, 
fixed- interval  smoothing  and  the  filtering  algorithms,  with  the  proper 
substitution  of  discrete  indices. 

Section  5.3.3,  presents  an  example  to  indicate  how  one  can 
apply  the  proposed  algorithms  to  physical  systems  evaluation. 

Section  5.3.4,  presents  the  general  estimation  algorithms 
for  the  linear  counterpart  of  the  above  problems;  the  results  are 
identified  with  those  of  the  literature. 

5.3.1  The  Problem  Statement 

Consider  the  discrete  nonlinear  multiple  time  delays  message 
model  of  Equation  (5.2.1)  with  the  measurement  model  described  by 

N 

y(k)  =  l  h.[x(k-j),  k-j]  +  v(k)  (5.3.1) 

j=o  J 

where  h.  is  an  m-dimensional  nonlinear  vector  valued  function,  v  is 
J 

zero_mean  white  noise  sequence  correlated  with  w,  with  prior  statistics 
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and  all  initial  conditions  identical  to  that  of  Section  5.2.1. 

The  state  estimation  problem  also  follows  that  of  Section  5.2.1. 

5.3.2  The  Derivation  of  the  Smoother 

With  reasoning  similar  to  that  of  Section  4.2.2,  it  is 
assumed  that  the  smoothed  estimate  satisfies  the  dynamic  equation* 


x(k-£+l/k+l ) 


bj[x(k-£-j/k),  k-£-j]  +  K^+1y(k+l) 


(5.3.2) 


A 

Since  x(k-£+l/k+l)  is  required  to  be  an  unbiased  estimate, 
it  can  be  shown  that 


L  L  A 

l  b  . [x ( k-£- j/k ) ,  k - £ - j ]  =  l  f,[x(k-£-j),  k-£- j/k] 

j=o  J  j=o  J 


-  Kk+1  l  h,[x(k-j+l),  k-j+l/k] 
K  1  j=o  J 

where 


*  When  other  constraining  equations  such  as 

x(k-a.+l/k+l )  =  l  b.[x(k-j.-j/k),  k-£-j]  +  K*  ,{y(k+l) 

j=o  J  K  1 

-  Ek[y(k+1 )]} 


is  used,  the  resulting  smoothed  estimate  remains  the  same. 
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fj[x(k-£-j),  k-t-j/k]  =  Ek(fj[x(k-Ji-j) ,  k-£-j]} 


and 

hj[x(k-j+l ) ,  k-j+l/k]  =  Ek{h.[x(k-j+l),  k-j+1 ] } 


Also  that. 


l  f.[x(k-£-j),  k-£-j/k]  =  x(k-£+l/k)  (5.3.3) 

j=o  J 

Then 


x(k-£+l/k+l)  =  x(k-£+l/k)+Kj^  {y(k+l )-jh- [x (k-j+1 ), k-j+l/k]} 

j=oJ 

(5.3.4) 


whereas  the  smoothing  error  satisfies  the  relation 


x(k-£+l/k+l)  =  x(k-£+l/k)  -  Kk+1 


N 


i  l 

j=o 


hj[x(k-j+l )/k]  +  v(k+l ) } 

(5.3.5) 


where  it  is  defined  that 

h  .[x(k-j+l)/k]  =  h  - [x ( k-j+1 ) ,  k-j+1]  -  h  - [x (k-j+1 ), k-j+l/k] 
J  J  J 

and  the  matrix  V  (k-£+l/k+l)  can  be  determined  from  Equations  (5.2.4) 

X 

and  (5.3.5). 


In  order  that  the  cost  function  of  Equation  (5.2.3)  is  minimized. 
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the  optimal  value  of  the  matrix 
gradient  of  J(k+1)  equal  to  the  null 


is  obtained  by  setting  the 
matrix.  Hence, 


K*  =  E.{x(k-A+l/k)  l  h  .[x(k- j+1 )/k] } 

j=o 


N 


(5.3.6) 


N 


Ov(k+l)  +  E, {  l  h,[x(k-i+l )/k]  h.[x(k-j+1 )/k]}] 
V  k  i ,j=o  1  J 


-1 


N 


and  VJk-i+l/k+1)  =  V^(k-s.+l/k)  -  K*  Ek{£  h.[x(k-j+l)/k] 
XX  j=o  J 


T 

x  (k-t+l/k)}  (5.3.7) 


also  if  the  matrix  V  (k-n+1,  k-m+l/k+1)  is  defined  as 

x 


V^(k-n+l,  k-m+l/k+1)  =  {x(k-n+l /k+1 )  x  (k-m+l/k+1)} 
x 


then  it  can  be  shown  that 


V  (k-n+1,  k-m+l/k+1) 

% 

x 


=  V%(k-n+l ,  k-m+1 /k) 


h  •  [x ( k- j+1 )/k]x  (k-m+1 /k) } 

J 

(5.3.8) 


for  any  integers  n,m  with  0  <_  n,m  £  k+1. 


As  is  presented  in  Section  5.2.2,  the  unbiased  estimate 
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x(k-£+l/k)  of  the  state  vector  x(k-£+l)  can  also  be  obtained  as 


x(k-t+l/k)  =  l  f.[x(k-£-j),  k-£- j/k-1 ] 
j=o  3 

*  N  , 

+  Kj{y(k)  -  l  h.[x(k-j),  k-j/k-1 ] }  (5.3.9) 

K  j=0  J 

z 

and  the  optimal  value  of  <k  is  given  by 

5  L  *  N  ,J 

4  =  IX  i<I  f,[x(k-i-i)/k-l]  l  h.[x(k-j)/k-l]} 

K  K  i=o  1  j=o  J 

+  G[x(k-£),  k-£]  ^wv(k)  <5k}k_£l  (5.3.10) 

[y  (k)  +  E.  Al  h,[x(k-i  )/k-l]  h.[x(k-j  )/k-l]}]_1 
v  k~  i  ,j=o  1  J 


where 

f . [x(k-t-i )/k-l]  =  fi[x(k-£-i ),  k-t-i]  -  f. [x(k-£-i ) ,k-£-i/k-l] 


V  (k-n+1  ,k-m+l/k)  =  Y  E.  ,{f .[x(k-n-i )/k-l] 

%  i  i=o  K~ 1  1 


•  f  - [x ( k-m- j )/k- 1 ] } 

J 


+  G[x(k-n) ,k-n]yw(k-n)6k_njk_mGT[x(k-m),k-m] 


'  Kk^wv(k-m)6k.m>kGT[x(k-m)»k-m^  (5.3.11) 
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+  E 


k-1 


N 

u 


1=0 


o- 

i 


[x(k-i )/k-l] 


l  f ,[x(k-m-j)/k-l]}] 
j=  o  J 


whereas  V^(k-£+l/k)  can  be  obtained  from  the  above  equation  by  setting 
x 

both  n  and  m  equal  to  £. 

It  should  be  noted  that  three  different  types  of  smoothing 
and  the  filtering  algorithms  all  follow  immediately  from  Equations 
(5.3.3),  (5.3.4),  (5.3.6)  -  (5.3.8)  and  (5.3.10)  -  (5.3.11)  with  the 
following  substitution: 


The  replacement  The  replacement 


for  k+1  for  a 

The  filtering  estimation  k+1  0 

The  fixed-lag  smoothing  k  N 

The  fixed-point  smoothing  k+1  k-N+1 

The  fixed-interval  smoothing  N  N-k 


Also  notice  that  in  the  case  of  nonlinear  systems,  the 
preceeding  algorithms  involve  infinite  dimensional  systems  for 
realization,  which  is  practically  impossible  except  in  trivially 
simple  cases. 

In  order  that  the  smoother  can  be  implemented  in  computer 
evaluation,  it  is  assumed  that  the  conditional  probability  density 
functions  of  the  smoothing  errors  are  Gaussian.  This  is  very 
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significant  for  systems  with  product  types  or  polynomial  nonl ineari tes , 
since  in  these  cases,  the  smoothing  algorithms  can  be  evaluated  or 
physically  realized  without  any  further  approximation. 

Moreover,  the  smoothing  estimators  are  already  required  to 
be  unbiased,  therefore  the  expectation  terms  that  are  in  Equations 
(5.3.6)  -  (5.3.8)  and  (5.3.10)  -  (5.3.11)  can  be  further  simplified 
by  Equations  (4.2.20)  -  (4.2.22)  and  (5.2.21)  and  also 

*  'J 

Ek_i < hi Cx ( k-i )/k-l]  hj[x(k-j )/k-l ] 

=  Ek_i f hi [x ( k-i ) , k-i ]  hT[x(k-j) , k- j ] } 

-  hi[x(k-i ) k-i/k-1 ]  h^[x(k-j)  ,k-j/k-l] 

Therefore,  the  estimation  algorithms  presented  in  this 
chapter  are  not  only  new  and  recursive  in  nature,  but  also  very 
effective  computationally  speaking  and  shows  the  close  link  between 
various  types  of  state  estimation  problems. 

5.3.3  II lus tvative  Examp le 

To  illustrate  the  use  of  the  preceeding  algorithms  consider 
the  nonlinear  systems  equations  presented  in  Section  4.3.3.  However, 
noise  processes  v(k)  and  w(k)  are  correlated  with  the  following 
statistic  : 


. 

. 

- 
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E  E{v(k)  wT(j)>  =  fvw(k)  6kjj 


The  initial  conditions  and  prior  statistics  are  the  same 
as  Section  4.3.3. 

Then  following  the  evaluation  as  presented  in  Section  4.3.3, 
and  applying  Equations  (5.3.3),  (5.3.4),  (5.3.6),  (5.3.7)  and  (5.3.10), 
with  £  set  to  zero,  the  filtering  algorithms  are  obtained  as  follows: 


x(k+l/k)  =  3 V  (k)  x(k)  +  x  (k/k)  +  2x(k-l/k) 


x 


x(k+l/k+l )  =  x(k+l/k)  +  K°+1{y(k+l) 

-  V  (k+l/k)  -  x  (k+l/k)  -  x(k) } 
x 


K°  ,  =  [2Vy  k+l/k)  £(k+l/k)  +  V^(k+1 ,k/k)] 

X  X 

O  (k+1)  +  2V^(k+l/k)  +  4V^( k+l/k) 
x  x 

2 

x  (k+l/k)  +4Vjk+l,k/k)  x(k+l/k)  +  Vjk)]"1 
x  x 

V  (k+l/k+1)  =  vy  k+l/k)  -  K°  [2Vjk+l/k) 
XX  x 

x(k+l/k)  +  Vjk+l,k/k)] 
x 


4-  > 


x<? 
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where. 


V  (k+l/k)  =  15V^(k/k-l)  +  36V^(k/k-l)  x  (k/k-1) 

x  x 


+  9VJ  k/k-1)  x  (k)  +  12V  (k.k-l/k-l)  V  (k/k-1) 
x  xx 


+  12V  (k,k-l/k-l )  x  (k/k-1) 
x 


+  4V^(k-l/k-l ) 
x 


+  fw(k)  -  <?{»  (k)  +  2V^( k/k-1 ) 

x 


+  4V  (k/k-1)  x  (k/k-1)  +  4V  (k,k-l/k-l)  x(k/k-l) 

"X/  X/ 

X  X 


+  v^(k-l)}  k°T 

X 


0 

Kk 


{^wv(k)  G'[x(k),k] 


+  12V  (k/k-1 )  x (k/k-1 )  +  3V  (k,k-l/k-l)  V  (k/k-1) 

X  XX 


+  10V  (k/k-1)  X  (k/k-1)  +  3 V  (k/k-1)  x(k/k-l)  x(k-l/k-l) 

X  X 

X  X 

.2  .5  .3 

+  3Vjk, k-l/k-1)  x  (k/k-1)  +  x  (k/k-1)  +  x  (k/k-1) 
x 


x(k-l/k-l)  +  4V  (k-1, k/k-1)  x ( k/k-1 )  +  2V  (k-l/k-1) 

X  X 

X  X 


+  2x (k-l/k-1)  V  (k/k-1)  +  2x(k-l/k-l)  x  (k/k-1) 

X 
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+  2x  (k-l/k-1)}.  (k)  +  2V  (k/k-1 ) 

v  ^ 

x 

.2 

+  4V  (k/k-1)  x  (k/k-1)  +  4V  (k, k-l/k-1) 

'\j  f\j 

x  x 


•  x  ( k/k- 1 )  +  Vjk-1)} 
x 


In  the  same  manner,  the  fixed-lag,  fixed-point  and  fixed- 
interval  smoothing  algorithms  can  be  obtained  from  Equations  (5.3.3), 
(5.3.4),  (5.3.6),  (5.3.7),  (5.3.10)  and  (5.3.11)  by  proper  choice  of 
discrete  time  indices  k  and  £,  as  listed  in  Section  5.3.2. 


Notice  that  in  the  above  evaluation,  the  only  assumption 
needed  is  the  Gaussian  assumption  of  the  estimator  error. 


5.3.4  Estimation  in  Linear  Discrete  Systems 


In  this  section,  the  message  and  measurement  models  of  the 
estimation  problems  are  linear  and  discrete.  It  can  be  considered 
as  a  special  case  of  the  nonlinear  estimation  problems,  satisfying 
the  following  relations: 


L  L 

y  f.[x(k-j),  k-j]  =  l  F.(k)  x(k-j) 
j =0  J  j=o  J 

N  N 

I  h.[x(k-j),  k-j]  =  l  H  •  ( k )  x(k-j) 

j=0  J  j=0  J 

and 


G[x(k),  k]  =  G(k) 


£>: 
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Then  the  linear  estimation  algorithms  can  be  obtained  directly 
from  results  presented  in  Section  5.3.2.  They  are: 


x(k-£+l/k+l)  =  x(k-£+l/k)  +  k£+]  {y (k+1 ) 

N 

-  I  H  •  (k+1 )  x(k-j+l/k) } 
j=o  J 

x(k-£+l/k)  =  l  F.(k-A)  x (k-£-j/k) 

J  =  0  J 

.  N  T 

K*  =  l  V  fk-1+1,  k-j+l/k)  H  (k+1) 
j=o  x  J 

N  T 

[V  (k+1)  +  l  H. (k+1 )  V  (k+l-i.k+l-j/k)  H  (k+1)] 

V  i,j=o  x  J 


V  (k-4+l/k+l)  =  V^(k-£+l/k)  -  kJL,  {£  H . (k+1 ) 
X  x  j  =  o 


•  V  (k-j+1 ,k-£+l /k) } 

*\t 

x 

L 

V  (k-£+l/k)  =  7  F . (k-£)  V  (k-£-i ,k-£-j/k-l ) 

x  i.j=o  1  x 


•  fT (k-A)  +  G(k-i)  t  (k-£)  GT(k-£) 

J 


-  K*[Twv(k-i)  GT(k-£)  + 


N  L  T 

l  l  H- (k)  V^( k-i , k-£- j/k-1 )  F j ( k— £ ) ] 
i=o  j=o  x  J 


' 


. 
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and 

,  L  N 

<k  =  Cl  l  Fj(k-i)  V  (k-d-i.k-j/k-l) 

i=o  j=o  x 

Hj (k)  +  G(k-s> )  Ywv ( k-A ) ] 

N  T  1 

'  [fv(k)  +  I  H.(k)  V  (k-1  ,k-j/k-l )  H  (k)r' 
i,j=o  x  J 

The  three  different  types  of  smoothing  and  the  filtering 
algorithms  all  follow  immediately  from  the  above  equations  with 
the  proper  choice  of  k  and  £  listed  in  Section  5.3.2. 

For  linear  state  and  measurement  models  without  delays, 

the  results  agree  well  with  those  in  the  literature  [49].  And  if 

T  and  N  are  set  to  zero,  the  results  of  this  section  become  the 
wv 

stable  estimation  algorithms  as  was  presented  by  Priemer  and 
Vacroux  [37,  38].  For  linear  systems  with  uncorrelated  noise  processes 
the  results  also  agree  well  with  that  of  Shukla  and  Srinath  [56], 

5 . 4  CONCLUSION 

In  this  chapter  a  new  unified  approach  has  been  presented  to 
obtain  the  fixed-lag,  fixed-point,  fixed-interval  smoothing  and  the 
filtering  algorithms  for  discrete  nonlinear  delayed  systems  with  non- 
del  ayed  measurements  as  well  as  multichannel  delayed  measurements 
corrupted  by  correlated  noise  processes.  The  derivation  is  straight¬ 
forward  and  shows  the  close  link  between  the  smoothing  and  filtering 
estimation  problems. 


The  results  are  new,  recursive  in  nature,  and  physically 
realizable  under  the  assumption  that  the  probability  density  functions 
of  the  estimator  errors  are  Gaussian,  they  can  be  obtained  without 
any  further  approximations  for  systems  with  product  types  or  polynomial 
nonlinearities.  They  are  expected  to  be  computationally  faster  than 
applying  an  extended  Kalman  filter  to  an  augmented  system  with  no 
explicit  delay  terms.  Since  the  augmentation  of  the  state  leads  to 
higher  dimension  systems,  this  is  particularly  significant  for  systems 
with  numerous  delay  parameters. 

For  linear  estimation  problems,  the  results  agree  well  with 
those  of  the  literature  and  are  computationally  stable,  however,  for 
nonlinear  estimation  algorithms  the  stability  behavior  of  the  systems 
needs  to  be  further  investigated. 

The  results  presented  can  be  extended  to  continuous  time 
problems  through  a  formal  limiting  procedure. 


■ 
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CHAPTER  VI 


MINIMUM  VARIANCE  FILTERING  AND  SMOOTHING 

FOR  NONLINEAR  SYSTEMS  WITH  COLORED  NOISE 

6 . I  INTRODUCTION 

This  chapter  extends  the  technique  developed  in  preceeding 
chapters  to  another  physically  realistic  estimation  problem.  That 
is,  the  state  estimation  of  nonlinear  discrete  delayed  systems  with 
non-delayed  measurement  as  well  as  measurements  containing  multiple 
delays  imbedded  in  colored  noise  sequence. 

In  the  past,  for  linear  estimation  problems  without  time 
delays,  two  methods  have,  in  general,  been  employed  to  account  for 
time-correl ated  measurement  noise  in  a  linear  filter.  The  first 
method,  proposed  by  Kalman  [3]  employed  the  state  augmentation  method, 
it  involves  the  construction  of  a  hypothetical  shaping  filter  whose 
input  is  white  noise  and  whose  output  is  noise  of  the  required 
correlations.  Here,  the  dynamics  of  the  shaping  filter  are  included 
as  a  part  of  the  system  dynamics,  and  its  states  are  estimated 
together  with  the  system  states.  The  second  method  was  developed  by 
Bryson  and  Henrikson  [7],  it  employs  output  differencing  technique, 
which  devises  a  new  modified  measurement,  that  subtracts  the  most 
recent  past  value  of  the  measurement,  weighted  by  the  noise  correlation 
coefficient  matrix,  from  the  present  measurement.  As  a  result,  the 
modified  measurement  only  contains  white  noise  processes. 
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Recently,  Priemer  and  Vacroux  [37,  38]  reported  filtering 
and  fixed-lag  smoothing  algorithms  for  linear  discrete  time-delayed 
systems  corrupted  by  white  noise  sequences,  using  the  orthogonal 
projection  method.  Biswas  and  Mahal anabis,  employed  the  state 
augmentation  technique  for  linear  discrete  time-delayed  estimation 
problems  involving  colored  noise  processes  [55,  61]. 

To  the  author's  knowledge,  nonlinear  estimation  problems 
for  systems  with  or  without  delays,  imbedded  in  colored  noise  processes, 
are  still  outstanding.  As  an  innovation,  this  thesis  presents 
a  direct  derivation  of  estimation  algorithms  for  discrete  nonlinear 
time-delayed  systems  with  non-del ayed  measurements  as  well  as  multi¬ 
channel  measurements  involving  time  delays  and  being  imbedded  in 
colored  noise  processes. 

The  derivation  does  not  require  the  augmentation  of  the 
state  vector,  and  hence  does  not  require  an  increase  in  the  order  of 
the  system  and  leads  to  a  filter  that  is  computationally  efficient. 

The  proposed  technique  makes  use  of  the  matrix  minimum  principle  to 
derive  the  optimal  values  of  the  coefficients  in  the  estimation 
algorithms  under  the  requirements  that  the  estimates  be  unbiased  and 
the  error-variance  be  minimized;  they  are  reported  in  [62,63]. 

6 . 2  NONLINEAR  DISCRETE  DELAYED  SYSTEMS  WITH  NON -DELAYED  MEASUREMENTS 
IMBEDDED  IN  COLORED  NOISE 

In  this  section,  estimation  algorithms  for  all  three 
cases  of  smoothing  and  the  filtering  estimation  are  derived  for  non¬ 
linear  discrete  time-delayed  systems  with  non-delayed  measurements 


H,  I 
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imbedded  in  colored  noise. 

Section  6.2.1  presents  the  statement  of  the  problem. 

Section  6.2.2  describes  the  derivation  of  the  nonlinear  smoother. 
Sections  6.2.3  and  6.2.4  respectively,  present  the  fixed-lag  smoothing 
and  the  fixed-point  smoothing  algorithms.  Fixed-interval  smoothing 
and  filtering  algorithms  are  not  presented,  they  can  be  obtained  from 
the  results  of  Section  6.2.2,  with  the  proper  choice  of  discrete 
time  indices. 

For  linear  estimation  problems  the  three  cases  of  smoothing 
and  the  filtering  algorithms  follow  immediately  from  the  results 
presented  in  Section  6.2.5.  They  are  identified  with  various  results 
in  literature. 

6.2.1  The  Problem  Statement 

The  process  considered  in  this  section  is  described  by 

L 

x(k+l)  =  l  f-[x(k-j),  k-j]  +  G[x(k),k]  w(k)  (6.2.1) 

j=o  J 

where  the  measurement  is  given  by 

y(k)  =  H(k)  x(k)  +  y ( k )  (6.2.2) 

and  the  measurement  noise  y(k)  is  a  colored  sequence  given  by 


y(k+l )  =  A(k+1 ,k)  y ( k )  +  v(k) 


(6.2.3) 
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Here,  k=0,l,2,  ...»  is  the  discrete  time  index,  the  state  x  is  an 

n-vector;  the  measurement  y  an  m-vector;  the  state  noise  sequence  w 

an  r-vector;  the  measurement  noise  y  an  m-vector;  G,  a  nonlinear  state 

dependent  nxr  matrix;  f .,  a  nonlinear  n-dimensi onal  vector  valued 

J 

function;  H  and  A  are  respectively,  mxn  and  mxm  matrices. 

The  noise  sequences  w  and  v  are  correlated  zero-mean  white 
Gaussian  with  known  statistics 

E{w(k)  vT(J) }  =  twv  (k)  skJ 

E{w(k)  wT(j)}  =  tw  (k)  6k>J. 

and 

E{v(k)  vT  (j)}  =  Tv( k)  6kjJ- 

for  all  integers  k  and  j,  where  E  denotes  the  expected  value,  6^  y 
the  Kronecker  delta,  the  matrices  v  and  v  u  are  non-negative  definite, 
whereas  is  positive  definite. 

The  initial  states  are  zero-mean  Gaussian  random  vectors, 
which  are  independent  of  v(k)  and  w(k).  And  for  all  positive  integers 
m  and  a,  of  interest  it  is  required  that 

E{x(-m)  xT(-£)}  =  Vx(m,£)  . 


(6.2.4) 
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The  smoothing  problem  is  to  obtain  x(k-£+l/k+l ) ,  the  unbiased 
smoothing  estimate  of  x(k-£+l)  for  non-negative  integer  values  of  £, 
conditioned  on  the  set  of  measurements 

Y(k+D  =  (y (0) ,  y ( 1 ) ,  ...,  y (k+1 )} 


such  that  the  cost  function 

J (k+1 )  =  Trace  [V  (k-a+l/k+1 )]  (6.2.5) 

x 

is  minimized. 

6.2.2  The  Derivation  of  the  Smoother 

Examination  of  Equations  (6.2.2)  and  (6.2.3),  indicates  that 
a  modified  observation  z(k+l)  can  be  defined  such  that  it  is  a  white 
Gaussian  sequence.  That  is 

z(k+l)  =  y (k+1 )  -  A ( k+1 ,k)  y(k) 

which  can  be  further  simplified  as 

L 

z ( k+1 )  =  7  d - [x ( k- j ) ,  k-j]  +  H ( k+1 ) 

j=o  J 


•  G[x(k),  k]  w(k)  +  v ( k) 


(6.2.6) 
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where 

L  L 

l  d.[x(k-j),  k-j]  =  H(k+1 )  l  f,[x(k-j),  k-j] 
J=o  J  j=o  J 


-  A(k+1  ,k)  H(k)  x ( k )  (6.2.7) 

Note  that  the  modified  observation  z(k+l)  contains  as  much  information 
as  that  of  y(k+l ) . 

It  can  be  assumed  that  the  smoothed  estimate  is  constrained 
by  the  discrete  nonlinear  dynamic  equation 


x(k-n+l/k+l)  =  y  b.[x(k-n-j/k),  k-n-j]  +  K*  z(k+l)  (6.2.8) 

j=o  J 


r\j 

Let  x(k-£+l/k+l)  denote  the  smoothing  error  defined  by 


x(k-£+l/k+l)  =  x ( k-£+l )  -  x(k-£+l/k+l)  (6.2.9) 


In  order  that  the  smoothed  estimate  is  unbiased,  it  can  be  shown  that 


L  L 

I  b,[x(k-£-j/k),  k-£-j]  =  l  f . [x(k-£-j ) ,  k-a-j/k] 
j=o  J  j=o  J 

-  L  . 

-  K^+,  l  d.[x(k-j),  k-j/k]  (6.2.10) 

K  1  j=o  J 


where  the  following  identities  are  introduced 


and 


fj[x(k-£-j ) ,  k-£- j/k]  =  Ek{fj[x(k-£-j) ,  k-£-j]} 


dj[x(k-j),  k-j/k]  =  Ek{ dj [x ( k-j ) ,  k-j] } 


with  Ek  denoting  the  expectation  operation  conditioned  on  the  set  of 
measurements  Z(k). 

Using  Equations  (6.2.1),  (6.2.8)  and  (6.2.10),  the  smoothing 
error  equation  of  Equation  (6.2.9)  can  be  simplified  as 


x(k-£+l/k+l)  =  J  f .[x(k-£-j)/k]  +  G[x(k-£),  k-£] 

j=o  3 

•  w(k-£)  -  «k+i  {H(k+1 )  G[x(k),k]  w(k) 

L  * 

+  v(k)  +  l  d.[x(k-j)/k]}  (6.2.11) 

j=o  J 

where 

if .[x(k-£-j)/k]=  f.[x(k-£-j),  k - £ - j ]  -  f,[x(k-£-j),k-£-j/k] 

J  J  vJ 


d-[x(k-j)/k]  =  d - [x ( k-j ) ,  k-j]  -  d,[x(k-j),  k-j/k] 
J  J  J 


and 


' 
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The  error-variance  matrix  V^(k-£+l /k+1 )  can  be  obtained  by 

x 

taking  the  expectation  of  Equation  (6.2.11)  multiplied  by  it's  own 
transpose. 

£ 

Then  the  optimal  value  of  the  matrix  K^+-j  can  be  obtained  by 
setting  the  gradient  of  Equation  (6.2.5)  equal  to  the  null  matrix, 
thereto  re. 


Kf+1  =  [E.  {Y  f,[x(k-£-i)/k]  d.[x(k-j)/k] 
k+i  K  ^>j=0  1  j 

+  G[x ( k-Jk ) ,  k-A]  fw(k-Jl)  «k_1;|<GT[x(k),k]  HT(k+l) 


+  G[x(k-&).  k-n]  fwv(k-£)  6k_£jk]  (6.2.12) 


•  [Kv(k)  +  H(k+1)  G[x(k),k]  Yw(k)  GT[x(k) ,k]  HT(k+l) 

+  E.  { J  d.[x(k-i)/k]  d . [x ( k- j )/k] } 

K  i > j=o  1  J 

+  H(k+1)  G[x(k),k]  fwv(k)  +  fvw(k)  GT[x(k),k]  HT(k+l )] 


The  smoothing  estimate  and  its  error-covari ance  equation  are  given 

by 

L  . 

x(k-£+l/k+l )  =  7  f,[x(k-£-j) ,  k-A-j/k] 

j=o  J 

+  Kk+1{y(k+l )  -  A(k+l,k)  y(k)  (6.2.13) 
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-  H(k+1 )  l  f.[x(k-j),  k-j/k]  -  A(k+1 ,k) 
j=o  J 


•  H(k)  x(k) } 


and 

L  a. 

Vjk-m+1,  k-t+l/k+l)  =  [E.{£  f . [x(k-m-i )/k] 

x  i,j=o 

•  f j[x (k-A-j )/k] } 


+  G[x(k-m),  k-m] 


•  G^[x(k-d),  k-4]] 


„  L  J 

'  Kk+l[Ek(I  «*i[x(k-1)/k]  f.[x(k- 
K  1  K  1,j=o  J 


+  H(k+l)G[x(k),k]fw(k)6k;k_tGT[x(k- 


+  T»w(lc-l)Sk.k-/[x(k-*)*k-l]3  (6 


respectively.  And  V  (k-Ji+1/k+l)  can  be  obtained  from  Equation  (6.2 

x 

by  setting  m  equal  to  1. 


t- j  )/k] } 

a)  ,k-£] 

.2.14) 

.14) 


Also,  it  can  be  shown  that 
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l  fi[x(k-£-j),k-£-j/k]  =  x (k-£+l/k) 
j  =  o  J 


(6.2.15) 


then  Equations  (6.2.13)  and  (6.2.14)  can  be  respectively  rewritten  as: 


x(k-£+l/k+l )  =  x(k-£+l/k)  +  K^+-j{y (k+1 ) 


-  H(k+1 )  x(k+l/k) 


(6.2.16) 


and 


-  A(k+1 ,k)  [y (k)  -  H(k)  x(k)]} 


Vjk-m+l,k-£+l/k+l)  =  Vjk-m+l,k-£+l/k) 
x  x 


-  K^+1[EkI  d1-[x(k-i  )/k] 


1  ,J=0 


oJ 

f  j  [x  ( k-5.- j  )/k] } 


T, 


H(k+l)G[x(k),k]'Fw(k)6kjk_&G‘[x(k-£)sk-£] 


+  \w(k"^)(5k,k-£GT^x^k"£^k"£^  (6-2-17) 


L  a, 

Vjk-m+1  ,k-£+l/k)  =  Ek(^  .  f . [x(k-m-i )/k] 


i  ,j=o 


a. 


fI[x(k-£-j)/k]}  +  G[x(k-m),k-m]'F  (k-m) 
j  w 


where 
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‘  Sk-m,k-£GT[x(k'il),k")l]  (6.2.18) 

Here,  Equations  (6.2.12),  (6.2.15)  -  (6.2.18)  form  the 

/\ 

recursive  smoothing  algorithm  starting  with  initial  condition  x(-£/-l) 
for  nonnegative  integer  values  of  m  and  £.  The  above  equations  can 
be  modified  to  give 


x(-£/o)  =  x(-£/-l)  +  K^[y(o)  -  H(o )  x(o/-l)] 

and 

Vj-£,  -£/o)  =  V  (-Jt,  -A/-1)  -  K^H(o)  V  (o,  -£/- 1) 

X  X  X 

where 

K*  =  Vj-i,  0/-1)  HT(o)  [H (o )  V^(o,o/-l )  HT(o)  +  T  (o)]'1 

X  X  T 

Notice  that  in  the  above  derivation  the  results  obtained  are 
exact  and  optimal  with  respect  to  the  constraints  of  Equation  (6.2.8) 
minimizing  the  error-vari ance  cost  functional.  However,  in  the  case 
of  nonlinear  systems,  the  preceeding  algorithms  require  infinite 
dimensional  systems  to  realize  the  various  terms  involving  the 
expectations. 

In  order  that  the  smoother  can  be  physically  realized,  the 
conditional  probability  density  functions  of  the  smoothing  errors  are 
assumed  to  be  Gaussian,  and  Equation  (6.2.9)  can  be  used  to  substitute 
for  the  state  variables.  This  is  particularly  significant  for  systems 
with  product  types  or  polynomial  nonlinearities,  since  in  these  cases. 


the  smoothing  algorithms  can  be  evaluated  without  much  difficulty  and 
are  physically  realizable  without  any  further  need  of  approximation. 

It  must  be  noted  that  the  results  presented  in  this  chapter 
are  new,  the  smoothing  algorithms  can  be  implemented  in  the  computer 
evaluation,  the  computations  are  expected  to  be  efficient,  since  no 
augmentation  of  state  variables  are  involved.  Another  very  significant 
contribution  of  the  presented  derivation  is  the  fact  that  all  three 
cases  of  the  smoothing  problems  and  the  filtering  estimation  follow 
immediately  from  the  above  results,  with  the  proper  choice  of  discrete 
time  indices  k  and  i.  They  are  given  as  follows: 


Replacement  Replacement 
for  k _  for  z _ 

The  filtering  estimation  k  0 

The  fixed-lag  smoothing  k-1  N 

The  fixed-point  smoothing  k  k-N+1 

The  fixed-interval  smoothing  N-l  N-k 


It  is  clear  that  a  unified  approach  has  been  developed  to 
derive  the  three  different  classifications  of  smoothers  and  the 
filtering  algorithms  for  discrete  nonlinear  delayed  systems  involving 
colored  noise.  The  derivation  clearly  shows  the  close  link  between 
the  various  smoothers  and  filter. 

6.2.3  The  Nonlinear  Fixed-Lag  Smoothing 

In  this  section,  nonlinear  fixed-lag  smoother  and  it's  error- 
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variance  equations  are  presented.  They  follow  immediately  from 
Equations  (6.2.12),  (6.2.15)  -  (6.2.18),  with  substitution  of  k  and 
N  for  k+1  and  i9  respectively.  The  results  are: 

x(k-N/k)  =  x ( k-N/k-1 )  +  K^{y(k)  -  H(k)  x(k/k-l) 

-  A(k,k-1 )[y (k-1 )  -  H(k-l)  x(k-l)]} 

x ( k-N/k-1 )  =  l  f . [x ( k-N- j - 1 ) ,  k-N-j-l/k-1] 
j=o  J 

Kk  =  ^Ek-l{l  .  f-j  [x(k-N-i-l ) / k- 1  ]  dJ.[x(k-l-j  )/k-l] 
i , j_o 

+  G[x(k-1-N),k-1-N]  *w(k-l-N)  6k_1.N>k_1GT[x(k-l),k-l] 

•  HT(k)  +  G[x(k-1-N),k-1-N]  \v(k-l-N)  VuN.k-l] 

•  0„(k-l)  +  H(k)  G[x (k-1 ) ,k-l]  f  (k-1 )  GT[x(k-l ) , k-1 ] 

V  w 

•  HT(k)  +  E.  ,{£  d.[x(k-l-i)/k-l]d Jx(k-l-j)/k-l]} 

K_l  i ,j=o  J 

+  H(k)  G[x(k-1 )  ,k-l]  fwv(k-l)  +  fvw(k-l)  GT[x(k-l ) ,k-l] 

•  HT(k}]_1 


VJk-N/k)  =  VJk-N/k-l)  -  K^[E ,  ,{£  l[x(k-l-i)/k-l] 

x  x  i,j=o 

•  f [x  (k-l-N-j  )/k-l]}  +  H(k-l)G[x(k-l),k-l]v  ,(k-l) 

J  w 

'  6k-l>k_i-NGT|>(k-1-N)’  k-]-N] 

+  ^(k-l-N)  «k_!  >k_-,_NGT:x(k-l-N),k-l-N]] 
and 

L 

Vjk-N/k-1)  =  E.  ,{£  f.[x(k-l-N-1)/k-l] 

x  i,j=o 

•  f.[x(k-l-N-j)/k-l]}  +  G[x(k-1-N),k-1-N] 

U 

•  ¥  (k-l-N)  GT[x(k-l-N) ,k-l-N] 

w 

6.2.4  The  Nonlinear  Fixed-Point  Smoothing 

In  this  section  a  nonlinear  fixed-point  smoother  is 
obtained  from  Equations  (6.2.12),  (6.2.15)  -  (6.2.18),  with  k-N+1 
substituted  for  Z.  Then 


x(N/k+l )  =  x(N/k)  +  K^+1  {y(k+l ) 

-  H(k+1 )  x(k+l/k)  -  A(k+1 ,k)[y(k)-H(k)  x(k)]> 
L  . 

x ( N/k )  =  y  f.[x(N-l-j),  N-l-j/k] 

3=o  J 


k-N+1 

k+1 


+ 


+ 


+ 


+ 


V^(N/k+1 ) 
x 


V^(N/k)  = 
x 
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Jl  f.[x(N-1-i)/k]  d.[x(k-j)/k] 
i  ,j=o  J 


G[x(N-1),N-1]  Yw(N-1)  6N.ljkGT[x(k),k]  HT(k+l ) 


G[x(N-l).N-l]  fwv(N-l) 


[fv(k)  +  H(k+l)G[x(k),k]'Fw(k)GT[x(k),k]  HT(k+l ) 


Ek{l  di[x(k-i)/k]  dj[x(k-j )/k] } 

i » J 

H(k+l)G[x(k),k]  fwv(k)  +  Yvw(k)  GT[x(k),k] 


HT(k+l)]'1 


=  VjN/k)  -  l<k+i+1[Ek{I  d.[x(k-i )/k] 
x  i  ,j=o 

J 

•  fj[x(N-l-j)/k]}  +H(k+l)G[x(k),k]  ^(k) 


’  Sk,N-lGTCx(N'1),N'1]  +  ^w(N'1){k,N-lGT[x(rM)>N~1] 

E.{£  fi[x(N-l-i)/k]f-[x(N-l-j)/k]} 

K  i , j=o  1  J 


G[x ( N- 1 )  ,N-1]  ¥  (N-l )  GT[x(N-1),N-1] 

W 
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6.2.5  Estimation  in  Linear  Discrete  Systems 

The  estimation  problem  of  linear  discrete  systems  can  be 
considered  as  a  special  case  of  the  nonlinear  estimation  problems 
satisfying  the  relation  that 

L  L 

l  f-j[x(k-j )  ,  k-j]  =  l  F.(k)  x(k-j) 
j=o  J  j=o  J 

G[x(k) ,k]  =  G(k) 


From  Equations  (6.2.12),  (6.2.15)  -  (6.2.18),  the  resulting 
linear  estimation  algorithms  are  as  follows: 

x(k-£+l/k+l)  =  x(k-£+l/k)  +  K^+1(y(k+l) 

-  H(k+1 )  x(k+l/k)  -  A(k+1  , k ) [y (k)-H(k)  x(k)]} 
L 

where  x(k-£+l/k)  =  l  F.(k-t)  x(k-£-j/k) 

j=0  J 

K*+1  =  [|  F^k-i)  Vjk-n-i.k-j/k)  F j ( k ) 

T  L  J 

•  HT(k+l)  +  l  F.(k-t)  Vjk-£-i,k/k)  H  (k) 

1=0  x 

•  AT(k+l,k)  +  G(k-0'fw(k-£)«k.JjkGT(k-j)HT(k+l) 


x<? 
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and 


+  G<k'£)  ¥wv<k-£)  V*,k] 

'  l>v(k)  +  H(k+l)G(k)  fw(k)  GT(k)  HT ( k+1 ) 

L  T  T 

+  H(k+1)  l  F.(k)  V  (k-i,k-j/k)  F  (k)  H  (k+1 ) 

i  ,j=o  x  J 

L  T  T 

-  H (k+1 )  l  F . ( k )  V  (k-i ,k/k)  H1 (k)  A1 (k+1) 

1=0  X 

L  T  T 

-  I  A(k+1)  H(k)  v  (k.k-j/k)  F  (k)  H1 (k+1) 

j  =  0  X  J 

+  A(k+1 )  H(k)  Vjk/k)  HT(k)  AT(k+l) 

x 

+  H ( k+1 )  G(k)  Ywv(k)  +  Tvw(k)  GT(k)  HT ( k+1 )]-1 


V  (k-a+l/k+1 )  =  VJk-n+l/k)  -  K£Q  H(k+1)  F.(k) 

X  i,j=0 


•  V  (k-i ,k-ii-j/k)  fT ( k- A )  +  T  A(k+l,k)  H(k) 
«  J  ^  _ 


1=0 


•  V  (k,k-4-j/k)  FT(k-jt) 

x  3 

T 

+  H ( k+1 )  G(k-n)  6k,k-4',,w(k-)t)G(k-Ji) 
+  GT(k-^)  «kik.t] 
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L 

where,  Vjk-£+l/k)  =  £  F.(k-£)  Vjk-£-i  ,k-£-j/k) 

x  i,j=o  1  x 

*  fT  ( k-£ )  +  G(k-£)  Y  ( k  -  £ )  GT(k-£) 

Three  different  classifications  of  smoothers  and  the 
filtering  algorithms  can  be  obtained  from  the  preceeding  algorithms 
with  substitution  for  k  and  £,  identical  to  those  of  Section  6.2.2. 

In  the  particular  case  of  linear  message  model  without 
delays,  the  results  presented  above  agree  well  with  algorithms 
developed  in  the  literature  [6,7].  For  the  case  of  linear  message 
model  with  delays,  the  results  presented  for  fixed-lag  smoother  and 
fixed-point  smoother  can  be  identified  with  those  presented  by  Biswas 
and  Mahalanabis  [55,61]. 

6 . 3  NONLINEAR  DISCRETE  DELAYED  SYSTEMS  WITH  MEASUREMENTS  CONTAINING 
MULTIPLE  DELAYS  IMBEDDED  IN  COLORED  NOISE 

This  section  derives  nonlinear  estimation  algorithms  for 
nonlinear  discrete  delayed  systems  with  measurements  containing 
multiple  time  delays,  corrupted  by  colored  noise.  The  results  are  new, 
recursive  in  nature  and  directly  yield  the  three  different  classification 
of  smoothers  and  the  filtering  estimator. 

Section  6.3.1  presents  the  statement  of  the  problem,  in 
Section  6.3.2,  the  nonlinear  smoother  are  derived.  Sections  6.3.3,  6.3.4 
and  6.3.5  respectively,  present  the  nonlinear  fixed-lag,  fixed-point 
smoothers  and  the  filtering  algorithms. 


. 
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General  linear  estimation  algorithms  are  presented  in 
Section  6.3.6,  the  results  can  be  directly  applied  to  the  fixed-lag, 
fixed-point,  fixed-interval  smoothers  and  the  filtering  estimator. 

The  results  obtained  here  for  linear  discrete  systems  are  also  new. 

6.3.1  The  Problem  Statement 

In  this  section,  the  discrete  nonlinear  time-delayed 
message  model  is  described  by  Equation  (6.2.1),  the  measurement  noise 
sequence  y(k)  is  modelled  by  Equation  (6.2.3),  and  the  measurement 
model  is  given  by 

M 

y(k)  =  I  H  -  (k)  x(k-j)  +  y(k)  (6.3.1) 

j=o  J 

here,  H.  is  an  mxn  dimensional  matrix.  All  initial  conditions  and 

J 

priori  statistics  are  the  same  as  that  of  Section  6.2.1.  The  smoothing 
estimation  problem  is  also  identical  to  that  of  Section  6.2.1. 

6.3.2  The  Derivation  of  the  Smoother 

Examination  of  Equations  (6.3.1)  and  (6.2.3)  indicates  that  a 
modified  measurement  signal  z(k+l)  consisting  of  an  additive  white  noise 
sequence  can  be  obtained  by  multiplying  y(k)  by  A(k+l,k)  and  subtract¬ 
ing  it  fromy(k+l),  for  which 

z(k+l )  =  y(k+l)  -  A(k+1 ,  k)  y(k)  (6.3.2) 

Notice  that  subtracting  a  function  of  the  previous  observation  in  no 
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way  affects  the  present  observation,  and  hence  the  estimate  conditioned 
on  Y(k+1)  is  equivalent  to  that  with  Z(k+1). 

Equation  (6.3.2)  can  be  further  simplified  as 

L  M 

z(k+l)  =  l  l  d,  ,-[x(  k-j-i ) ,  k-j-i] 

)  (6.3.3) 

f i [x ( k-j-i), 

M 

l  H .  (k)x(k-j ) 
j=o  J 

(6.3.4) 

It  is  assumed  that  the  smoothed  estimate  is  constrained  by 
the  discrete  nonlinear  dynamic  equation 

x(k-£+l/k+l )  =  y  b.[x(k-£-j/k),  k-£-j]  +  K*  ,z(k+l)  (6.3.5) 

j=o  J  k+i 

and  since  it  is  required  that  the  smoothed  estimate  is  unbiased,  the 
following  relation  is  obtained 

L  L  . 

I  b,[x(k-£-j/k),  k - £ - j ]  =  l  f i [x(k-£-j ) ,  k-a-j/k] 
j=o  J  j=o  J 

L  M 

-  Kk+i  l  l  d  -  -[x( k-j-i ), k-j-i /k] 

i=o  j=o  ,J 


i=o  j  =o 


*  5J 


M 


+  l  H . (k+1 )  G[x(k-j),  k-j]  w(k-j)  +  v(k 
j=o  J 


where 


L  M 

l  l  d.  - [x ( k-j-i ) , k-j-i] 

A  -rs  A -rs  1  >  J 


1=0  J  =0 


L  M 

l  l  H  .  (k+1) 

i=o  j=o  J 


k-j - i ] -A (k+1  ,k) 


. 


164 


where 

L  .  L 

I  f-[x(k-£-j)  ,k-£-j/k]  =  e£  {  f  -[x(k-£-j  )  ,k-£-j]/Z(k)} 
j=0  J  j=0  J 


A1  so 


L  M  A 

l_  l_  di}J-  Cx(k-j-i),  k-j-i/k] 


1=0  j=o 
L  M 


=  l  di  [x ( k- j -i ) ,  k-j-i]/Z(k) } 

i=o  j=o 


(6.3.6) 


On  the  basis  of  Equation  (6.2.1),  the  following  identify  is 

obtained 


x(k-£+l/k)  =  l  f-[x(k-£-j),  k-£-j/k]  (6.3.7) 

j=o  J 

then  Equation  (6.3.5)  can  be  simplified  as 


x(k-£+l/k+l)  =  x(k-£+l/k)  +  K^+-|  { z ( k+ 1 ) 

L  M  . 

-  I  Id.  .  [x(k-j-i),  k-j-i/k]}  (6.3.8) 

1  n  -  r\  1  *J 


Substituting  Equation  (6.3.2)  into  (6.3.8),  and  using  the 
identities  of  Equations  (6.3.4)  and  (6.3.6),  the  smoothing  error  takes 
the  form 


9  M 

x(k-«.+l/k+l )  =  x(k-£+l/k)  -  K*  .{7  H.(k+1)  x(k-j+l  /k) 

k  1  j=o  J 


- 
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-  A(k+l,k)  l  H  .  (k)  x (k-j/k)  +  v(k) }  (6.3.9) 

j=o  J 

Now  the  error-variance  equation  for  V^(k-£+l/k)  can  be 

x 

obtained  by  taking  the  expectation  of  Equation  (6.3.9)  multiplied  by 

a 

its  own  transpose.  Then  the  optimal  value  of  the  matrix  can 
be  obtained  by  setting  the  gradient  of  Equation  (6.2.5)  equal  to  the 
null  matrix. 

As  a  result,  the  following  relations  are  found 


=  q  V  (k-*+l,  k-j+l/k)  hT ( k+1 ) 
j=o  x  J 


M  T  T 

-  I  V  (k-£+l,  k-j/k)  H  (k)  A  (k+1  ,k)  +  G[x(k-*)  ,k-£] 

j  =  0  X  J 


•  T  (k-£)  6,  „  i  } 

wvx  '  k-£,k 


M  j 

•  {  l  H . (k+1 )  V  (k-i+1,  k-j+l/k)  H  (k+1) 

•  *  I  ^  J 

1,j=0  X 

M  j  j 

+  l  A (k+1 ,k)  H.(k)  V  ( k- i , k-j/k )  H  (k)  A1 (k+1 ,k) 
i  ,j=o  x  J 

M  j  j 

-  I  H  -  (k+1 )  Vj k-i  +  1, k-j/k)  HWk)  A1  (k+1  ,k) 

i  ,j=o  x  J 

M  j 

-  I  A ( k+1  ,k)  H . ( k )  V  (k-i , k-j+l/k)  H  (k+1)  (6.3.10) 

i,j=o  x  J 

M 

+  fv(k)  +  l  Hi(k+l )G[x(k-i) ,k-i]Ywv(k-i )6k_i 
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M 

+  l_  ^(k-j)  GT[x(k-j)’k-j]  Hj  ( k+l )  6k-jsk}_1 

3  ^ 
and 

Vjk-m+l,k-£+l/k+l)  =  V^(k-m+l  ,k-£+l/k) 
x  x 

m  M 

-  Ci«I  H .  (k+l )  V  (k-j+l ,  k-i+l/k) 

K  1  j=o  J  x 

M 

-  I  A(k+l,k)H.(k)V  (k-j.k-n+l/k) 

j  =  0  J  X 

+  'fvw(k-n)«k.£jkGT[x(k-4)>k-£]>  (6.3.11) 


Furthermore,  from  Equations  (6.2.1)  and  (6.3.7),  one  can  obtain 


x(k-£+l/k)  =  y  f-[x(k-£-j)  ,k-£-j/k]  +  G[x(k-£) ,k-£]w(k-£) 
j  =  0  J 

(6.3.12) 


therefore , 

Vjk-m+1  ,k-£+l/k) 
x 


L 


l 


J 


E  Jf,-[x(k-m-i),k-m-i/k]f,[x(k-£-j  ) , 

,3=0  K  1  J 

k-£-j/k]}  +  G[x(k-m),  k-m]  v  (k-m) 

w 


•  «k.m>k.£  GT[x(k-£),  k-n]  (6.3.13) 


and 


(6.3.14) 


x  ( k- £/ k ) } 


for  all  non-negative  integers  m  and  a  of  interest.  Note  that 

V^(k-£+l/k+l )  and  Vn  (k-Ji+l/k)  can  be  easily  obtained  from  Equations 
x  x 

(6.3.11)  and  (6.3.13)  with  m  set  equal  to  a. 

It  should  be  noted  that  the  above  estimation  algorithms  are 
very  general,  they  can  be  applied  to  all  three  cases  of  smoothing  as 
well  as  the  filtering  estimation  problems.  The  results  obtained 
are  exact,  new  and  optimal  with  respect  to  the  constraining  equation 
of  (6.3.5)  minimizing  the  error-variance  cost  functional  of  Equation 
(6.2.5). 

However,  in  the  case  of  nonlinear  systems,  the  preceeding 
algorithms  require  infinite  dimensional  systems  to  realize  the 
various  terms  involving  the  expectations. 

For  practical  realization,  it  can  be  assumed  that  the 
conditional  probability  density  functions  of  the  estimator  errors  are 
Gaussian.  Under  such  an  assumption,  for  systems  with  product  types  or 
polynomial  nonlinearities,  the  presented  algorithms  can  be  physically 
realized  without  any  further  needs  of  approximation.  The  computer 
evaluation  are  expected  to  be  efficient,  since  no  augmentation  of 
state  variables  are  involved.  Other  significant  advantages  of  the 
above  technique  include  the  facts  that  the  derivation  is  straightforward 
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and  easy  to  apply  to  various  other  estimation  problems,  the  results 
clearly  show  the  close  link  between  the  various  smoothers  and  filters, 
and  also  the  algorithms  can  be  computed  recursively  starting  with 

A 

initial  condition  x(-£/-l)  and  V  (-m,-£/-l)  for  non-negative  values 

x 

of  m  and  £. 

6.3.3  The  Nonlinear  Fixed-Lag  Smoothing 

With  substitution  of  k+1  =  k  and  I  =  N,  where  N  is  a  fixed 
integer  and  k(>N)  is  variable.  Equations  (6.3.7),  (6.3.8),  (6.3.10) » 
(6.3.13)  and  (6. 3. 1 4)yield  the  discrete  fixed-lag  smoothing  algorithms 
for  nonlinear  time-delayed  systems  with  colored  noise.  They  are 


x(k-N/k-l )  =  l  f,[x(k-l-N-j),k-l-N-j/k-l] 
j=o  J 


x ( k-N/ k )  =  x(k-N/k-l)  +  {z(k) 


L  M 


l  l  d.  -[x(k-l-j-i),  k-1 - j-i/k-1 ] } 
i=o  j=o  1 


k"» 


M 


,T 


{j;  V  (k-N.k-j/k-l)  H  (k) 

.  J 

J  =  0  X 


M  T  T 

-  I  Vjk-N,k-j-l/k-l)  H*.  (k-1  )A  (k,k-l  )+G[x  ( k-N-1 ) ,  k-N-1  ] 
j=o  x  J 

•  'i'Wv(k‘N~1)6k-N-l,k}'{£  ..  Hi  (k)V^(k-i >k-j/k-l )Hj (k) 

1  ,J  0  X 


M 


,T, 


y  A(k,k-1  )H  .  (k-1  )V^(k-i  -1  ,k-j-l/k-l  )H '•  (k-1  )A  (k,k-l ) 
i,j=o  x 


+ 
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-  I  H,(k)V  (k-i,k-j-l/k-l)H^(k-l)AT(k,k-l) 

i  ,j  =  o  x  J 

M 

-  I  A(k,k-1 )  H.(k-l)  V  (k-i-1 >k-j/k-l )  H  (k) 

i,j=o  x  J 

M 

+  ¥v(k-l)  +  ^Hi(k)G[x(k-l-i),k-l-i]Ywv(k-l-i)6k_1_ijk_1 

+  f.Jv w(k-l-j)GT[x(k-l-j),k-l-j]H}(k)  5k.1.j>k.1}-1 

M 

Vjk-N/k)  =  V  (k-N/k-1)  -  Oj  H  (k) 

XX  j=0  J 

M 

•  Vjk-j, k-N/k-1)  -  l  A(k,k-l)H.(k-l) 

X  j=0  J 


•  Vjk-j-1, k-N/k-1) 
x 


+  ^(k-N-l)  «k_N.1>k  GT[x(k-N-l),k-N-l]} 


V  (k-N/k-1) 


L  ^ 

T  E.  ,{f .[x(k-l-N-i),k-l-N-i/k-l] 
i » j=o  K_l  1 


f,[x(k-l-N-j),k-l-N-j/k-l]} 

J 


+  G[x ( k-1  -N )  ,k-l  -N]  'F  (k-l-N)GT[x(k-l-N)  ,k-l-N] 

w 


L 

(k-N.k-i-l/k-l)  =  l  Ek.1 


(f j[x(k-N-j-l ) ,k-N-j-l/k-l] 

vJ 
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T 

*  x  ( k-i-l/k-1 ) } 


Whereas  V  (k+l/k)  can  be  directly  obtained  from  Equation  (6.3.13) 
x 

with  substitution  of  m=£=0,  and  V  (k-i+1 ,k-j+l/k)  is  the  same  as 

x 

that  of  Equation  (6.3.13). 

6.3.4  The  Nonlinear  Fixed-Point  Smoothing 

If  £=k-N+l  is  substituted  into  Equations  (6.3.7),  (6.3.8), 
(6.3.10)  -  (6.3.11)  and  (6.3.13)  -  (6.3.14),  where  k+1  >  N,  and  N 
is  fixed,  then  the  above  equations  represent  the  fixed-point  smoothing 
algorithms  for  nonlinear  time-delayed  systems  with  colored  noise. 

These  equations  are 


x(N/k)  =  l  fi[x(N-l-j),N-l-j/k] 
j=o  J 


x(N/k+l)  =  x(N/k)  +  K^+1  {z(k+l ) 

L  M 

-  I  7d.  -[x(k-j-i),  k-j-i/k]} 
i=o  j=o  ljJ 

Ku+r1=  {I  V  (N.k-j+l/k)  hT ( k+1 ) 

j=o  X  J 

M  T  T 

*  I  V  (N ,k- j/k)  H  (k)A!(k+l,k)  +  G[x(N-l ) , N- 1 ] 
J=0  x  J 


•  6N-l,k} 
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•(I  H-  (k+l )V  (k-i+1 ,k-j+l/k)  hT ( k+1 ) 
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M 

+  I_  't'vw(k-j)GT[x(k-j),k-j]  Hj(k+1 )  <Sk_J>kJ-1 
j  ^ 
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M 
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M 
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where 
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VjN,k-i/k) 
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Here,  V^(k+l/k)  and  Vo (k-i  +  1  ,k-j+l/k)  are  the  same  as  that  of 
x  x 
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Section  6.3.3. 

6.3.5  The  Nonlinear  Filtering 

The  nonlinear  filtering  algorithms  are  obtained  from  Equations 
(6.3.7),  (6.3.8),  (6.3.10),  (6.3.11)  (6.3.13)  and  (6.3.14)  with  a  and  m 
set  to  zero.  The  results  are: 


x(k+l/k+l)  =  x(k+l/k)  +  K°+1  (z(k+l) 


L  M 
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j=o  X  J 

M  T  T 
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i  ,j=o 


and 


+  G[x(k),k]  Yw(k)  G  [x(k),k] 


'V 
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V  (k+1 ,k-i/k)  =  l  E.{f,[x(k-j),k-j/k]  x  ( k-i/k) } 
x  j  =o  ^ 


6.3.6  Estimation  in  Linear  Discrete  Systems 

In  this  section,  general  linear  estimation  algorithms  are 
presented,  they  are  directly  obtained  from  results  of  Section  6.3.2, 
satisfying  the  following  relations 

L  L 

I  f,[x(k-j),  k-j]  =  l  F  •  ( k )  x(k-j) 
j=o  J  j=o  J 

G[x(k),k]  =  G(k) 


The  results  are: 
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x(k-£+l/k)  =  l  F.(k-fc)  x(k-Jt-j/k) 
j=o  J 

x(k-£+l/k+l )  =  x(k-£+l/k)  +  K^+-|  (z(k+l) 
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M 

-  I  A(k+l,k)  H.(k)  V  (k-j,k-ji+l/k) 
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L 
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The  three  different  classes  of  smoothers  as  well  as  the 
filtering  algorithms  can  be  obtained  from  the  above  equations  with  the 


fol lowi ng 

substi tut ion. 
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The 

fixed-lag  smoothing 

k-1 

N 

The 

fixed-point  smoothing 

k 

k-N+1 

The 

fixed-interval  smoothing 

N-l 

N-k 

The  results  presented  here  for  linear  estimation  are  new 

recursive 

:  in  nature.  If  the  stochastic 

processes  are 

Gaussian,  the 

estimator  is  optimal  for  all  reasonable  cost  functionals,  and  if  they 
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are  not,  then  the  algorithm  is  the  linear  estimator  minimizing  the 
err or- variance. 

6.4  CONCLUSION 

In  this  chapter,  a  new  unified  approach  has  been  presented 
to  obtain  the  fixed-lag,  fixed-point,  fixed-interval  smoothing  as 
well  as  the  filtering  algorithms  for  nonlinear  as  well  as  linear 
delayed  systems  with  non-del ayed  measurements  as  well  as  multi¬ 
channel  delayed  measurements  corrupted  by  colored  noise.  The 
derivation  is  straightforward  and  shows  the  close  link  between  various 
smoothers  and  the  filtering  estimator. 

Except  the  problem  of  linear  estimation  for  delayed  systems 
with  non-delayed  measurement,  all  the  results  obtained  are  new.  In 
that  case  the  results  agree  well  with  those  in  the  literature. 

The  results  obtained  are  exact  and  optimal  with  respect 
to  the  given  constraint  on  the  dynamic  equation  of  the  smoother, 
minimizing  the  error-variance  cost  functional. 

The  results  can  be  recursively  evaluated  under  the  assumption 
that  the  probability  density  functions  of  the  estimator  errors  are 
Gaussian,  they  can  be  obtained  without  any  further  approximations  for 
systems  with  product  types  or  polynomial  nonlinearities. They  are 
expected  to  be  computationally  efficient,  since  there  is  no 
augmentation  of  state  variables. 

The  results  can  also  be  extended  to  continuous  time  problems 
through  a  formal  limiting  procedure. 
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CHAPTER  VII 


CONCLUDING  REMARKS 


7.1  Conclusions 

This  thesis  is  devoted  to  the  derivation  of  nonlinear 
estimation  algorithms  for  discrete  and  continuous  nonlinear  systems 
with  and  without  delays,  given  non-delayed  measurements,  as  well  as 
multichannel  time-delayed  measurements,  corrupted  by  white  Gaussian 
noise,  correlated  noise,  colored  noise  as  well  as  noise-free  processes. 
Even  though  concluding  remarks  have  been  made  in  Chapters  III  to  VI, 
it  is  the  purpose  of  this  section  to  summarize  the  main  results  of 
this  dissertation. 

In  Chapter  III,  nonlinear  filtering  algorithms  are  derived 
for  continuous  nonlinear  systems  without  delays  corrupted  by  white 
noise,  correlated  noise  and  noise-free  processes.  For  systems 
corrupted  with  additive  white  noise,  the  derivation  is  new  and  exact, 
in  the  case  of  non-white  noise  processes,  the  results  obtained  are 
new  but  an  approximation  has  been  made.  The  main  technique  involves 
the  matrix  minimum  principle  together  with  the  Kolmogorov  and 
Kushner  equations  to  minimize  the  error-variance,  taken  to  be  the 
estimation  criterion. 

In  order  that  the  systems  be  physically  realizable,  the 
assumption  that  the  probability  density  functions  of  the  estimator 
errors  are  Gaussian  is  needed  to  expand  the  expectations  involved  in 
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the  estimation  algorithms.  Various  nonlinear  systems  were  simulated, 
the  noise  processes  are  assumed  to  be  additive  white,  they  are 
compared  with  results  obtained  from  various  other  finite  dimensional 
approximate  nonlinear  filters.  The  results  clearly  indicate  the 
superiority  of  the  proposed  minimum  variance  filter  over  those  of 
other  filters  investigated,  and  theoretical  explanations  are  given 
for  the  apparent  poor  performance  characteristics  of  various  other  filters 
considered.  In  the  particular  case  of  linear  systems,  the  results 
agree  well  with  those  of  the  literature. 

New  nonlinear  estimation  algorithms  for  discrete  nonlinear 
delayed  systems,  with  non-delayed  measurements  as  well  as  multi¬ 
channel  time-delayed  measurements  corrupted  by  white  noise,  correlated 
noise  and  colored  noise  processes  are  respectively  derived  in  Chapters 
IV,  V  and  VI.  The  estimation  algorithms  directly  yield  the  fixed-lag, 
fixed-point  and  fixed-interval  smoothing  and  the  filtering  algorithms, 
with  proper  substitution  of  the  discrete  time  indices.  The  proposed 
technique  makes  use  of  the  concept  of  the  gradient  matrix,  the  minimum 
principle  to  derive  the  optimal  values  of  the  coefficients  in  the 
estimation  algorithms  under  the  requirements  that  the  estimates  be 
unbiased  and  the  error-variances  minimized.  The  derivation  is  straight¬ 
forward  and  clearly  indicate  the  close  link  between  three  different 
classifications  of  smoothers  and  the  filtering  estimator. 

The  results  obtained  are  exact  and  optimal  with  respect  to 
the  imposed  constraints  on  the  dynamic  equations  of  the  estimators, 
minimizing  the  error-variances .  The  algorithms  derived  can  be 
implemented  in  the  computer  evaluation  under  the  assumption  that  the 
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probability  density  functions  of  the  estimator  errors  are  Gaussian. 
For  systems  with  product  types  and  polynomial  nonlinearities,  no 
further  approximation  is  needed  to  evaluate  the  expectations  involved 
in  the  algorithms.  They  are  expected  to  be  computationally  efficient, 
since  no  augmentation  of  state  variables  are  involved. 

The  results  can  also  be  applied  to  various  special  cases  of 
nonlinear  as  well  as  linear  estimation  problems;  for  example:  The 
estimation  problems  for  linear  or  nonlinear  systems  without  delays, 
the  estimation  of  linear  or  nonlinear  systems  with  multiple  delays 
only  appearing  in  the  measurements  and  the  linear  estimation  problems 
of  time-delayed  systems  corrupted  by  white  noise  as  well  as  non-white 
noise  processes,  etc. 

For  linear  estimation  problems  the  results  obtained  are 
identified  with  published  results  in  literature.  In  the  particular 
case  of  linear  time-delayed  systems  corrupted  by  additive  white 
noise,  the  results  are  stable,  but  the  stability  behaviour  of  the 
nonlinear  estimators  are  yet  to  be  investigated. 

The  results  obtained  in  Chapter  IV,  V  and  VI  can  be 
extended  to  continuous  time  systems  with  formal  limiting  argument  [4]. 
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7.2  Suggestions  for  Further  Research 

A  unified  approach  has  been  developed  to  derive  estimation 
algorithms  for  various  types  of  nonlinear  estimation  problems.  But 
many  more  nonlinear  estimation  algorithms  need  to  be  derived  for 
various  other  nonlinear  systems;  for  example:  noise  processes  involving 
delays,  signals  measured  through  jump  processes,  multistage  correlated 
processes.  Systems  with  jump  effect  or  discontinuous  nonlinearities. 
Besides  problems  considered  in  this  thesis,  estimation  for  continuous 
systems  with  discrete  measurements  are  yet  to  be  developed.  Nonlinear 
smoothers  for  continuous  nonlinear  delayed  systems  with  white  noise 
and  non-white  noise  processes  are  yet  to  be  extended  using  the  formal 
limiting  argument. 

Besides  deriving  estimation  algorithms,  one  also  needs  to 
know  more  about  the  performance  characteristics  of  the  estimators,  the 
sensitivity,  divergence  and  modelling  problems  are  of  extreme 
importance.  The  problems  of  error  compensation,  modelling  error  and 
error  detection,  data  editing,  data  compression  and  data  ordering  in 
real  systems  application  are  yet  to  be  considered.  One  very 
important  problem  which  requires  immediate  attention  is  the  stability 
behaviour  of  the  estimators  developed  in  this  thesis. 

Extensive  computer  simulation  also  needs  to  be  undertaken 
to  test  the  estimators  developed  in  Chapters  IV  to  VI,  and  they  are 
to  be  compared  it  other  finite  dimensional  estimators,  where  one  would 
have  used  the  extended  Kalman  Filters  to  approximate  the  nonlinear 
estimation  problems. 


J-  ^ 

- 


181 


Since  the  estimation  technique  developed  here  is  simple, 
straightforward  and  effective  in  dealing  with  wide-ranging  nonlinear 
systems,  one  would  immediately  think  of  its  parallel  development  to 
parameter  estimation,  identification  and  nonlinear  mean-square 
control  problems.  Once  can  also  think  of  the  extension  to  estimation 
problems  for  distributive  parameter  systems. 

Last  but  not  least,  the  nonlinear  estimation  algorithms  are 
bound  to  replace  the  role  of  linear  estimators  in  the  field  of 
communications,  guidance  and  control.  Therefore,  extensive  research 
and  development  work  is  yet  to  be  carried  out  to  employ  the  nonlinear 
estimators  to  real  systems  applications. 
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